Phage microarray profiling of the humoral response to disease

ABSTRACT

The present invention relates to compositions and methods for cancer diagnostics, including but not limited to, cancer markers. In particular, the present invention provides methods and compositions for phage microarray profiling of cancer (e.g., prostate, lung, or breast cancer). The present invention further provides novel markers useful for the diagnosis, characterization, and treatment of cancers.

This application claims priority to provisional application Ser. No.60/578,406, filed Jun. 9, 2004, which is herein incorporated byreference in its entirety.

The present invention was funded in part by NIH grant PSOCA69568. Thegovernment has certain rights in the invention.

FIELD OF THE INVENTION

The present invention relates to compositions and methods for diseasediagnostics. In particular, the present invention provides methods andcompositions for phage microarray profiling of cancer (e.g., prostate,lung or breast cancer). The present invention further provides novelmarkers useful for the diagnosis, characterization, and treatment ofdisease (e.g., cancers).

BACKGROUND OF THE INVENTION

Afflicting one out of nine men over age 65, prostate cancer (PCA) is aleading cause of male cancer-related death, second only to lung cancer(Abate-Shen and Shen, Genes Dev 14:2410 [2000]; Ruijter et al., EndocrRev, 20:22 [1999]). The American Cancer Society estimates that about184,500 American men will be diagnosed with prostate cancer and 39,200will die in 2001.

Prostate cancer is typically diagnosed with a digital rectal exam and/orprostate specific antigen (PSA) screening. An elevated serum PSA levelcan indicate the presence of PCA. PSA is used as a marker for prostatecancer because it is secreted only by prostate cells. A healthy prostatewill produce a stable amount—typically below 4 nanograms per milliliter,or a PSA reading of “4” or less—whereas cancer cells produce escalatingamounts that correspond with the severity of the cancer. A level between4 and 10 may raise a doctor's suspicion that a patient has prostatecancer, while amounts above 50 may show that the tumor has spreadelsewhere in the body.

When PSA or digital tests indicate a strong likelihood that cancer ispresent, a transrectal ultrasound (TRUS) is used to map the prostate andshow any suspicious areas. Biopsies of various sectors of the prostateare used to determine if prostate cancer is present. Treatment optionsdepend on the stage of the cancer. Men with a 10-year life expectancy orless who have a low Gleason number and whose tumor has not spread beyondthe prostate are often treated with watchful waiting (no treatment).Treatment options for more aggressive cancers include surgicaltreatments such as radical prostatectomy (RP), in which the prostate iscompletely removed (with or without nerve sparing techniques) andradiation, applied through an external beam that directs the dose to theprostate from outside the body or via low-dose radioactive seeds thatare implanted within the prostate to kill cancer cells locally.Anti-androgen hormone therapy is also used, alone or in conjunction withsurgery or radiation. Hormone therapy uses luteinizing hormone-releasinghormones (LH-RH) analogs, which block the pituitary from producinghormones that stimulate testosterone production. Patients must haveinjections of LH-RH analogs for the rest of their lives.

While surgical and hormonal treatments are often effective for localizedPCA, advanced disease remains essentially incurable. Androgen ablationis the most common therapy for advanced PCA, leading to massiveapoptosis of androgen-dependent malignant cells and temporary tumorregression. In most cases, however, the tumor reemerges with a vengeanceand can proliferate independent of androgen signals.

The advent of prostate specific antigen (PSA) screening has led toearlier detection of PCA and significantly reduced PCA-associatedfatalities. However, the impact of PSA screening on cancer-specificmortality is still unknown pending the results of prospective randomizedscreening studies (Etzioni et al., J. Natl. Cancer Inst., 91:1033[1999]; Maattanen et al., Br. J. Cancer 79:1210 [1999]; Schroder et al.,J. Natl. Cancer Inst., 90:1817 [1998]). A major limitation of the serumPSA test is a lack of prostate cancer sensitivity and specificityespecially in the intermediate range of PSA detection (4-10 ng/ml).Elevated serum PSA levels are often detected in patients withnon-malignant conditions such as benign prostatic hyperplasia (BPH) andprostatitis, and provide little information about the aggressiveness ofthe cancer detected. Coincident with increased serum PSA testing, therehas been a dramatic increase in the number of prostate needle biopsiesperformed (Jacobsen et al., JAMA 274:1445 [1995]). This has resulted ina surge of equivocal prostate needle biopsies (Epstein and Potter J.Urol., 166:402 [2001]). Thus, development of additional serum and tissuebiomarkers to supplement or replace PSA screening is needed.

SUMMARY OF THE INVENTION

The present invention relates to compositions and methods for diseasediagnostics. In particular, the present invention provides methods andcompositions for phage microarray profiling of cancer (e.g., prostate,breast, or lung cancer). The present invention further provides novelmarkers useful for the diagnosis, characterization, and treatment ofdisease (e.g., cancers).

Accordingly, in some embodiments, the present invention provides amethod, comprising: providing a phage library, wherein the phage librarycomprises a plurality of phage clones, each of the phage clonescomprising a cDNA obtained from a disease (e.g., cancer, autoimmunedisease, inflammatory disease, cardiovascular disease and diabetes) mRNAsample; enriching the phage library for phage clones comprising cDNAsspecific to the disease, where the enriching comprises binding the phagelibrary to a control IgG to remove non-disease specific phage clonesfollowed by binding the phage library to a disease specific IgG toenrich the phage library for disease specific phage clones, therebygenerating an enriched phage library; exposing the enriched phagelibrary to serum from disease patients and optionally serum fromnon-diseased control subjects to generate a immunoglobulin bound phagelibrary; and identifying phage clones that react with the serum from thedisease patients. In some embodiments, the method further comprises thestep of identifying phage clones that react with serum from the diseasesubjects, but not with the serum from non-diseased control subjects. Insome embodiments, the identifying comprises contacting theimmunoglobulin bound phage library with a first immunoglobulin thatbinds to immunoglobulins from the serum from patients having the diseaseand a second immunoglobulin that binds to a phage capsid protein. Insome embodiments, the identifying further comprises the step of exposingthe first and second immunoglobulins to third and fourth immunoglobulinswherein the third immunoglobulin binds to the first immunoglobulin andwherein the third immunoglobulin comprises a first label, and whereinthe fourth immunoglobulin binds to the second immunoglobulin and whereinthe fourth immunoglobulin comprises a second label. In some embodiments,the first and second labels are fluorescent dyes and the first labelemits fluorescence at a different wavelength than the second label. Insome embodiments, the method further comprises the step of exposing thelabeled phage library to an image scanner to identify phage clones thatreact with the serum from the disease patients but not with the serumfrom non-diseased control subjects. In some embodiments, the methodfurther comprises the step of determining the identity of genescontained in the phage clones that react with the serum from the diseasepatients but not with the serum from non-diseased control subjects. Insome embodiments, the disease is prostate, lung, or breast cancer. Incertain embodiments, the enriched phage library is arrayed on a solidsurface. In some embodiments, the disease specific IgG is purified fromthe serum of a patient with the disease. In some preferred embodiments,the enriching step is repeated 2 or more, and preferably 5 or moretimes. In preferred embodiments, the disease is cancer and the phageclones that react with the serum from the cancer patients but not withthe serum from non-cancer control subjects comprise cDNAs encoding tumorantigens. In certain embodiments, the present invention provides a tumorantigen identified by the above-described method.

In further embodiments, the present invention provides a method fordetecting cancer (e.g., prostate, breast or lung cancer), comprising:providing a sample (e.g., including, but not limited to, a blood sampleor a tumor sample) from a subject (e.g., a human) suspected of havingcancer; and detecting the presence or absence of a humoral response to atumor antigen (e.g., BRD2, eIF4G1, RPL22, RPL13A, HES1, hypotheticalprotein XP_(—)373908, ubiquilin 1, nucleolar protein 3 (NOL3),alpha-2-glycoprotein 1 or heat shock 70 kDa protein 8 (HSPA70)), therebydetecting cancer. In some embodiments, the detecting comprises exposingthe sample to an antibody and detecting the antibody binding to thetumor antigen. In other embodiments, the detecting comprises detectingthe presence of an autoantibody to the tumor antigen (e.g., by exposingthe sample to an autoantibody specific antibody and detecting theautoantibody specific antibody binding to the antibody). In some furtherembodiments, the method further comprises the step of providing aprognosis to the subject. In some embodiments, the detecting cancerfurther comprises detecting a stage of the cancer or a sub-type of thecancer.

In yet other embodiments, the present invention provides a kit fordetecting the presence of cancer (e.g., prostate, lung or breast cancer)in a subject, comprising: a reagent capable of (e.g., sufficient to)specifically detecting the presence of a tumor antigen (e.g., BRD2,eIF4G1, RPL22, RPL13A, HES1, hypothetical protein XP_(—)373908,ubiquilin 1, nucleolar protein 3 (NOL3), alpha-2-glycoprotein 1 or heatshock 70 kDa protein 8 (HSPA70)); and instructions for using the reagentfor detecting the presence of cancer in the subject. In someembodiments, the reagent is a tumor antigen specific antibody. In otherembodiments, the reagent is an antibody specific for an autoantibody tothe tumor antigen. In certain embodiments, the instructions compriseinstructions required by the food and drug administration for labelingof in vitro diagnostics.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 provides a schematic overview of the phage-microarray profilingmethod of some embodiments of the present invention.

FIG. 2 shows supervised analyses and validation of humoral immuneresponse candidates of prostate cancer. FIG. AB shows a ReceiverOperator Characteristic (ROC) curve based on multiplex analysis of the22 epitomic biomarkers. AUC, area under the curve. FIG. 2B showsimmunoreactivity of three representative clones validated by ELISA. FIG.2C shows titration curves of the humoral immune response to arepresentative phage-epitope clone (5′-UTR_BMI1).

FIG. 3 shows a gene expression meta-analysis of humoral immune responsecandidates. FIG. 3A shows a heatmap representation of the humoral immuneresponse for four in frame phage-epitope clones assessed across 129serum samples. FIG. 3B shows the relative gene expression levels of inframe phage-epitope clones assessed using publicly available DNAmicroarray data housed in ONCOMINE. FIG. 3C shows immunoblot validationof the overexpression of humoral response candidates at the proteinlevel in prostate cancer.

FIG. 4 shows a Table of clinical and pathology information of prostatecancer patients used for biopanning and epitope profiling in thetraining cohort of sera.

FIG. 5 shows a Table of clinical and pathology information of prostatecancer patients used for epitope profiling in the validation cohort ofsera.

FIG. 6 shows a Table of Clinical and pathology information ofhormone-refractory prostate cancer patients.

FIG. 7 shows a Table of prediction accuracy of KNN models.

FIG. 8 shows a Table that summarizes class predictions for the trainingsample set.

FIG. 9 shows a Table of class predictions for the independent testingsample set.

FIG. 10 shows a Table of class predictions of prostate cancer sera inwhich PSA levels are less than 4 ng/ml.

FIG. 11 (SEQ ID NOS:400-404) shows a Table of protein sequences ofin-frame phage epitope clones.

FIG. 12 (SEQ ID NOS:405-464) shows a Table of significant protein listfor epitope protein sequence alignment.

FIG. 13 shows a schematic of the approach used to identify epitomicbiomarkers of lung cancer in some embodiments of the present invention.

FIG. 14 shows performance of the immune response profile in the testset.

FIG. 15 shows humoral immune response profiles and patient survival.

FIG. 16 shows characterization of UBQLN1.

DEFINITIONS

To facilitate an understanding of the present invention, a number ofterms and phrases are defined below:

The term “epitope” as used herein refers to that portion of an antigenthat makes contact with a particular antibody.

When a protein or fragment of a protein is used to immunize a hostanimal, numerous regions of the protein may induce the production ofantibodies which bind specifically to a given region orthree-dimensional structure on the protein; these regions or structuresare referred to as “antigenic determinants”. An antigenic determinantmay compete with the intact antigen (i.e., the “immunogen” used toelicit the immune response) for binding to an antibody.

The terms “specific binding” or “specifically binding” when used inreference to the interaction of an antibody and a protein or peptidemeans that the interaction is dependent upon the presence of aparticular structure (i.e., the antigenic determinant or epitope) on theprotein; in other words the antibody is recognizing and binding to aspecific protein structure rather than to proteins in general. Forexample, if an antibody is specific for epitope “A,” the presence of aprotein containing epitope A (or free, unlabelled A) in a reactioncontaining labeled “A” and the antibody will reduce the amount oflabeled A bound to the antibody.

As used herein, the terms “non-specific binding” and “backgroundbinding” when used in reference to the interaction of an antibody and aprotein or peptide refer to an interaction that is not dependent on thepresence of a particular structure (i.e., the antibody is binding toproteins in general rather that a particular structure such as anepitope).

As used herein, the term “subject” refers to any animal (e.g., amammal), including, but not limited to, humans, non-human primates,rodents, and the like, which is to be the recipient of a particulartreatment. Typically, the terms “subject” and “patient” are usedinterchangeably herein in reference to a human subject.

As used herein, the term “subject suspected of having cancer” refers toa subject that presents one or more symptoms indicative of a cancer(e.g., a noticeable lump or mass) or is being screened for a cancer(e.g., during a routine physical). A subject suspected of having cancermay also have one or more risk factors. A subject suspected of havingcancer has generally not been tested for cancer. However, a “subjectsuspected of having cancer” encompasses an individual who has receivedan initial diagnosis (e.g., a CT scan showing a mass or increased PSAlevel) but for whom the stage of cancer is not known. The term furtherincludes people who once had cancer (e.g., an individual in remission).

As used herein, the term “subject at risk for cancer” refers to asubject with one or more risk factors for developing a specific cancer.Risk factors include, but are not limited to, gender, age, geneticpredisposition, environmental expose, previous incidents of cancer,preexisting non-cancer diseases, and lifestyle.

As used herein, the term “characterizing cancer in subject” refers tothe identification of one or more properties of a cancer sample in asubject, including but not limited to, the presence of benign,pre-cancerous or cancerous tissue, the stage of the cancer, and thesubject's prognosis. Cancers may be characterized by the identificationof the expression of one or more cancer marker or tumor antigen genes,including but not limited to, the cancer markers disclosed herein.

As used herein, the term “characterizing prostate tissue in a subject”refers to the identification of one or more properties of a tissuesample (e.g., including but not limited to, the presence of canceroustissue, the presence of pre-cancerous tissue that is likely to becomecancerous, and the presence of cancerous tissue that is likely tometastasize). In some embodiments, tissues are characterized by theidentification of the expression of one or more cancer marker or tumorantigen genes, including but not limited to, the cancer markersdisclosed herein.

As used herein, the term “cancer marker genes” refers to a gene whoseexpression level, alone or in combination with other genes, iscorrelated with cancer or prognosis of cancer. The correlation mayrelate to either an increased or decreased expression of the gene. Forexample, the expression of the gene may be indicative of cancer, or lackof expression of the gene may be correlated with poor prognosis in acancer patient. Cancer marker expression may be characterized using anysuitable method, including but not limited to, those described inillustrative Examples below.

As used herein, the term “a reagent that specifically detects expressionlevels” refers to reagents used to detect the expression of one or moregenes (e.g., including but not limited to, the cancer markers of thepresent invention). Examples of suitable reagents include but are notlimited to, nucleic acid probes capable of specifically hybridizing tothe gene of interest, PCR primers capable of specifically amplifying thegene of interest, and antibodies capable of specifically binding toproteins expressed by the gene of interest. Other non-limiting examplescan be found in the description and examples below.

As used herein, the term “detecting a decreased or increased expressionrelative to non-cancerous control” refers to measuring the level ofexpression of a gene (e.g., the level of mRNA or protein) relative tothe level in a non-cancerous prostate control sample. Gene expressioncan be measured using any suitable method, including but not limited to,those described herein.

As used herein, the term “detecting a change in gene expression in saidcell sample in the presence of said test compound relative to theabsence of said test compound” refers to measuring an altered level ofexpression (e.g., increased or decreased) in the presence of a testcompound relative to the absence of the test compound. Gene expressioncan be measured using any suitable method, including but not limited to,those described herein.

As used herein, the term “tumor antigen” refers to an immunogenicepitope (e.g., protein) expressed by a tumor cell. The protein may beexpressed by non tumor cells but be immunogenic only when expressed by atumor cell. Alternatively, the protein may be expressed by tumor cells,but not normal cells. Exemplary tumor antigens include, but are notlimited to, BRD2, eIF4G1, RPL22, RPL13A, HES1, and hypothetical proteinXP_(—)373908.

As used herein, the term “autoantibody” refers to an antibody producedby a host (with or without immunization) and directed to a host antigen(e.g., a tumor antigen).

As used herein, the term “cancer vaccine” refers to a composition (e.g.,a tumor antigen and a cytokine) that elicits a tumor-specific immuneresponse. The response is elicited from the subject's own immune systemby administering the cancer vaccine composition at a site (e.g., a sitedistant from the tumor). In preferred embodiments, the immune responseresults in the eradication of tumor cells everywhere in the body (e.g.,both primary and metastatic tumor cells).

As used herein, the term “instructions for using said kit for detectingcancer in said subject” includes instructions for using the reagentscontained in the kit for the detection and characterization of cancer ina sample from a subject. In some embodiments, the instructions furthercomprise the statement of intended use required by the U.S. Food andDrug Administration (FDA) in labeling in vitro diagnostic products. Asused herein, the term “cancer expression profile map” refers to apresentation of expression levels of genes in a particular type oftissue (e.g., primary, metastatic, and pre-cancerous tissues). The mapmay be presented as a graphical representation (e.g., on paper or on acomputer screen), a physical representation (e.g., a gel or array) or adigital representation stored in computer memory. Each map correspondsto a particular type of tissue (e.g., primary, metastatic, andpre-cancerous) and thus provides a template for comparison to a patientsample. In preferred embodiments, maps are generated from pooled samplescomprising tissue samples from a plurality of patients with the sametype of tissue.

As used herein, the terms “computer memory” and “computer memory device”refer to any storage media readable by a computer processor. Examples ofcomputer memory include, but are not limited to, RAM, ROM, computerchips, digital video disc (DVDs), compact discs (CDs), hard disk drives(HDD), and magnetic tape.

As used herein, the term “computer readable medium” refers to any deviceor system for storing and providing information (e.g., data andinstructions) to a computer processor. Examples of computer readablemedia include, but are not limited to, DVDs, CDs, hard disk drives,magnetic tape and servers for streaming media over networks.

As used herein, the terms “processor” and “central processing unit” or“CPU” are used interchangeably and refer to a device that is able toread a program from a computer memory (e.g., ROM or other computermemory) and perform a set of steps according to the program.

As used herein, the term “stage of cancer” refers to a qualitative orquantitative assessment of the level of advancement of a cancer.Criteria used to determine the stage of a cancer include, but are notlimited to, the size of the tumor, whether the tumor has spread to otherparts of the body and where the cancer has spread (e.g., within the sameorgan or region of the body or to another organ).

As used herein, the term “providing a prognosis” refers to providinginformation regarding the impact of the presence of cancer (e.g., asdetermined by the diagnostic methods of the present invention) on asubject's future health (e.g., expected morbidity or mortality, thelikelihood of getting cancer, and the risk of metastasis).

As used herein, the term “prostate specific antigen failure” refers tothe development of high prostate specific antigen levels in a patientfollowing prostate cancer therapy (e.g., surgery). As used herein, theterm “risk of developing prostate specific antigen failure” refers to asubject's relative risk (e.g., the percent chance or a relative score)of developing prostate specific antigen failure following prostatecancer therapy.

As used herein, the term “post surgical tumor tissue” refers tocancerous tissue (e.g., prostate tissue) that has been removed from asubject (e.g., during surgery).

As used herein, the term “subject diagnosed with a cancer” refers to asubject who has been tested and found to have cancerous cells. Thecancer may be diagnosed using any suitable method, including but notlimited to, biopsy, x-ray, blood test, and the diagnostic methods of thepresent invention.

As used herein, the term “initial diagnosis” refers to results ofinitial cancer diagnosis (e.g. the presence or absence of cancerouscells). An initial diagnosis does not include information about thestage of the cancer of the risk of prostate specific antigen failure.

As used herein, the term “biopsy tissue” refers to a sample of tissue(e.g., prostate tissue) that is removed from a subject for the purposeof determining if the sample contains cancerous tissue. In someembodiment, biopsy tissue is obtained because a subject is suspected ofhaving cancer. The biopsy tissue is then examined (e.g., by microscopy)for the presence or absence of cancer.

As used herein, the term “inconclusive biopsy tissue” refers to biopsytissue for which histological examination has not determined thepresence or absence of cancer.

As used herein, the term “non-human animals” refers to all non-humananimals including, but are not limited to, vertebrates such as rodents,non-human primates, ovines, bovines, ruminants, lagomorphs, porcines,caprines, equines, canines, felines, aves, etc.

As used herein, the term “disease” refers to any deviation from a normalstate in a subject. In preferred embodiments, the methods andcompositions of the present invention are useful in the diagnosis andtreatment of diseases where the immunological reaction (e.g., generationof immunoglobulins to native proteins) differs in subjects with diseaseand subjects not having disease. The present invention finds use withany number of diseases including, but not limited to, cancer, autoimmunedisease, inflammatory disease, cardiovascular disease and diabetes.

The term “label” as used herein refers to any atom or molecule that canbe used to provide a detectable (preferably quantifiable) effect, andthat can be attached to a nucleic acid or protein. Labels include butare not limited to dyes; radiolabels such as ³²P; binding moieties suchas biotin; haptens such as digoxgenin; luminogenic, phosphorescent orfluorogenic moieties; mass tags; and fluorescent dyes alone or incombination with moieties that can suppress or shift emission spectra byfluorescence resonance energy transfer (FRET). Labels may providesignals detectable by fluorescence, radioactivity, colorimetry,gravimetry, X-ray diffraction or absorption, magnetism, enzymaticactivity, characteristics of mass or behavior affected by mass (e.g.,MALDI time-of-flight mass spectrometry), and the like. A label may be acharged moiety (positive or negative charge) or alternatively, may becharge neutral. Labels can include or consist of nucleic acid or proteinsequence, so long as the sequence comprising the label is detectable.

The term “siRNAs” refers to short interfering RNAs. In some embodiments,siRNAs comprise a duplex, or double-stranded region, of about 18-25nucleotides long; often siRNAs contain from about two to four unpairednucleotides at the 3′ end of each strand. At least one strand of theduplex or double-stranded region of a siRNA is substantially homologousto or substantially complementary to a target RNA molecule. The strandcomplementary to a target RNA molecule is the “antisense strand;” thestrand homologous to the target RNA molecule is the “sense strand,” andis also complementary to the siRNA antisense strand. siRNAs may alsocontain additional sequences; non-limiting examples of such sequencesinclude linking sequences, or loops, as well as stem and other foldedstructures. siRNAs appear to function as key intermediaries intriggering RNA interference in invertebrates and in vertebrates, and intriggering sequence-specific RNA degradation during posttranscriptionalgene silencing in plants.

The term “RNA interference” or “RNAi” refers to the silencing ordecreasing of gene expression by siRNAs. It is the process ofsequence-specific, post-transcriptional gene silencing in animals andplants, initiated by siRNA that is homologous in its duplex region tothe sequence of the silenced gene. The gene may be endogenous orexogenous to the organism, present integrated into a chromosome orpresent in a transfection vector that is not integrated into the genome.The expression of the gene is either completely or partially inhibited.RNAi may also be considered to inhibit the function of a target RNA; thefunction of the target RNA may be complete or partial.

As used herein, the term “gene transfer system” refers to any means ofdelivering a composition comprising a nucleic acid sequence to a cell ortissue. For example, gene transfer systems include, but are not limitedto, vectors (e.g., retroviral, adenoviral, adeno-associated viral, andother nucleic acid-based delivery systems), microinjection of nakednucleic acid, polymer-based delivery systems (e.g., liposome-based andmetallic particle-based systems), biolistic injection, and the like. Asused herein, the term “viral gene transfer system” refers to genetransfer systems comprising viral elements (e.g., intact viruses,modified viruses and viral components such as nucleic acids or proteins)to facilitate delivery of the sample to a desired cell or tissue. Asused herein, the term “adenovirus gene transfer system” refers to genetransfer systems comprising intact or altered viruses belonging to thefamily Adenoviridae.

As used herein, the term “site-specific recombination target sequences”refers to nucleic acid sequences that provide recognition sequences forrecombination factors and the location where recombination takes place.

As used herein, the term “nucleic acid molecule” refers to any nucleicacid containing molecule, including but not limited to, DNA or RNA. Theterm encompasses sequences that include any of the known base analogs ofDNA and RNA including, but not limited to, 4-acetylcytosine,8-hydroxy-N-6-methyladenosine, aziridinylcytosine, pseudoisocytosine,5-(carboxyhydroxylmethyl) uracil, 5-fluorouracil, 5-bromouracil,5-carboxymethylaminomethyl-2-thiouracil,5-carboxymethylaminomethyluracil, dihydrouracil, inosine,N6-isopentenyladenine, 1-methyladenine, 1-methylpseudouracil,1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine,2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-methyladenine,7-methylguanine, 5-methylaminomethyluracil,5-methoxy-aminomethyl-2-thiouracil, beta-D-mannosylqueosine,5′-methoxycarbonylmethyluracil, 5-methoxyuracil,2-methylthio-N-6-isopentenyladenine, uracil-5-oxyacetic acidmethylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil,queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil,4-thiouracil, 5-methyluracil, N-uracil-5-oxyacetic acid methylester,uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, and2,6-diaminopurine.

The term “gene” refers to a nucleic acid (e.g., DNA) sequence thatcomprises coding sequences necessary for the production of apolypeptide, precursor, or RNA (e.g., rRNA, tRNA). The polypeptide canbe encoded by a full length coding sequence or by any portion of thecoding sequence so long as the desired activity or functional properties(e.g., enzymatic activity, ligand binding, signal transduction,immunogenicity, etc.) of the full-length or fragment are retained. Theterm also encompasses the coding region of a structural gene and thesequences located adjacent to the coding region on both the 5′ and 3′ends for a distance of about 1 kb or more on either end such that thegene corresponds to the length of the full-length mRNA. Sequenceslocated 5′ of the coding region and present on the mRNA are referred toas 5′ non-translated sequences. Sequences located 3′ or downstream ofthe coding region and present on the mRNA are referred to as 3′non-translated sequences. The term “gene” encompasses both cDNA andgenomic forms of a gene. A genomic form or clone of a gene contains thecoding region interrupted with non-coding sequences termed “introns” or“intervening regions” or “intervening sequences.” Introns are segmentsof a gene that are transcribed into nuclear RNA (hnRNA); introns maycontain regulatory elements such as enhancers. Introns are removed or“spliced out” from the nuclear or primary transcript; introns thereforeare absent in the messenger RNA (mRNA) transcript. The mRNA functionsduring translation to specify the sequence or order of amino acids in anascent polypeptide.

As used herein, the term “heterologous gene” refers to a gene that isnot in its natural environment. For example, a heterologous geneincludes a gene from one species introduced into another species. Aheterologous gene also includes a gene native to an organism that hasbeen altered in some way (e.g., mutated, added in multiple copies,linked to non-native regulatory sequences, etc). Heterologous genes aredistinguished from endogenous genes in that the heterologous genesequences are typically joined to DNA sequences that are not foundnaturally associated with the gene sequences in the chromosome or areassociated with portions of the chromosome not found in nature (e.g.,genes expressed in loci where the gene is not normally expressed).

As used herein, the term “gene expression” refers to the process ofconverting genetic information encoded in a gene into RNA (e.g., mRNA,rRNA, tRNA, or snRNA) through “transcription” of the gene (i.e., via theenzymatic action of an RNA polymerase), and for protein encoding genes,into protein through “translation” of mRNA. Gene expression can beregulated at many stages in the process. “Up-regulation” or “activation”refers to regulation that increases the production of gene expressionproducts (i.e., RNA or protein), while “down-regulation” or “repression”refers to regulation that decrease production. Molecules (e.g.,transcription factors) that are involved in up-regulation ordown-regulation are often called “activators” and “repressors,”respectively.

In addition to containing introns, genomic forms of a gene may alsoinclude sequences located on both the 5′ and 3′ end of the sequencesthat are present on the RNA transcript. These sequences are referred toas “flanking” sequences or regions (these flanking sequences are located5′ or 3′ to the non-translated sequences present on the mRNAtranscript). The 5′ flanking region may contain regulatory sequencessuch as promoters and enhancers that control or influence thetranscription of the gene. The 3′ flanking region may contain sequencesthat direct the termination of transcription, post-transcriptionalcleavage and polyadenylation.

The term “wild-type” refers to a gene or gene product isolated from anaturally occurring source. A wild-type gene is that which is mostfrequently observed in a population and is thus arbitrarily designed the“normal” or “wild-type” form of the gene. In contrast, the term“modified” or “mutant” refers to a gene or gene product that displaysmodifications in sequence and or functional properties (i.e., alteredcharacteristics) when compared to the wild-type gene or gene product. Itis noted that naturally occurring mutants can be isolated; these areidentified by the fact that they have altered characteristics (includingaltered nucleic acid sequences) when compared to the wild-type gene orgene product.

As used herein, the terms “nucleic acid molecule encoding,” “DNAsequence encoding,” and “DNA encoding” refer to the order or sequence ofdeoxyribonucleotides along a strand of deoxyribonucleic acid. The orderof these deoxyribonucleotides determines the order of amino acids alongthe polypeptide (protein) chain. The DNA sequence thus codes for theamino acid sequence.

As used herein, the terms “an oligonucleotide having a nucleotidesequence encoding a gene” and “polynucleotide having a nucleotidesequence encoding a gene,” means a nucleic acid sequence comprising thecoding region of a gene or in other words the nucleic acid sequence thatencodes a gene product. The coding region may be present in a cDNA,genomic DNA or RNA form. When present in a DNA form, the oligonucleotideor polynucleotide may be single-stranded (i.e., the sense strand) ordouble-stranded. Suitable control elements such as enhancers/promoters,splice junctions, polyadenylation signals, etc. may be placed in closeproximity to the coding region of the gene if needed to permit properinitiation of transcription and/or correct processing of the primary RNAtranscript. Alternatively, the coding region utilized in the expressionvectors of the present invention may contain endogenousenhancers/promoters, splice junctions, intervening sequences,polyadenylation signals, etc. or a combination of both endogenous andexogenous control elements.

As used herein, the term “oligonucleotide,” refers to a short length ofsingle-stranded polynucleotide chain. Oligonucleotides are typicallyless than 200 residues long (e.g., between 15 and 100), however, as usedherein, the term is also intended to encompass longer polynucleotidechains. Oligonucleotides are often referred to by their length. Forexample a 24 residue oligonucleotide is referred to as a “24-mer”.Oligonucleotides can form secondary and tertiary structures byself-hybridizing or by hybridizing to other polynucleotides. Suchstructures can include, but are not limited to, duplexes, hairpins,cruciforms, bends, and triplexes.

As used herein, the terms “complementary” or “complementarity” are usedin reference to polynucleotides (i.e., a sequence of nucleotides)related by the base-pairing rules. For example, for the sequence“A-G-T,” is complementary to the sequence “T-C-A.” Complementarity maybe “partial,” in which only some of the nucleic acids' bases are matchedaccording to the base pairing rules. Or, there may be “complete” or“total” complementarity between the nucleic acids. The degree ofcomplementarity between nucleic acid strands has significant effects onthe efficiency and strength of hybridization between nucleic acidstrands. This is of particular importance in amplification reactions, aswell as detection methods that depend upon binding between nucleicacids.

The term “homology” refers to a degree of complementarity. There may bepartial homology or complete homology (i.e., identity). A partiallycomplementary sequence is a nucleic acid molecule that at leastpartially inhibits a completely complementary nucleic acid molecule fromhybridizing to a target nucleic acid is “substantially homologous.” Theinhibition of hybridization of the completely complementary sequence tothe target sequence may be examined using a hybridization assay(Southern or Northern blot, solution hybridization and the like) underconditions of low stringency. A substantially homologous sequence orprobe will compete for and inhibit the binding (i.e., the hybridization)of a completely homologous nucleic acid molecule to a target underconditions of low stringency. This is not to say that conditions of lowstringency are such that non-specific binding is permitted; lowstringency conditions require that the binding of two sequences to oneanother be a specific (i.e., selective) interaction. The absence ofnon-specific binding may be tested by the use of a second target that issubstantially non-complementary (e.g., less than about 30% identity); inthe absence of non-specific binding the probe will not hybridize to thesecond non-complementary target.

When used in reference to a double-stranded nucleic acid sequence suchas a cDNA or genomic clone, the term “substantially homologous” refersto any probe that can hybridize to either or both strands of thedouble-stranded nucleic acid sequence under conditions of low stringencyas described above.

A gene may produce multiple RNA species that are generated bydifferential splicing of the primary RNA transcript. cDNAs that aresplice variants of the same gene will contain regions of sequenceidentity or complete homology (representing the presence of the sameexon or portion of the same exon on both cDNAs) and regions of completenon-identity (for example, representing the presence of exon “A” on cDNA1 wherein cDNA 2 contains exon “B” instead). Because the two cDNAscontain regions of sequence identity they will both hybridize to a probederived from the entire gene or portions of the gene containingsequences found on both cDNAs; the two splice variants are thereforesubstantially homologous to such a probe and to each other.

When used in reference to a single-stranded nucleic acid sequence, theterm “substantially homologous” refers to any probe that can hybridize(i.e., it is the complement of) the single-stranded nucleic acidsequence under conditions of low stringency as described above.

As used herein, the term “hybridization” is used in reference to thepairing of complementary nucleic acids. Hybridization and the strengthof hybridization (i.e., the strength of the association between thenucleic acids) is impacted by such factors as the degree ofcomplementary between the nucleic acids, stringency of the conditionsinvolved, the T_(m) of the formed hybrid, and the G:C ratio within thenucleic acids. A single molecule that contains pairing of complementarynucleic acids within its structure is said to be “self-hybridized.”

As used herein, the term “T_(m)” is used in reference to the “meltingtemperature.” The melting temperature is the temperature at which apopulation of double-stranded nucleic acid molecules becomes halfdissociated into single strands. The equation for calculating the T_(m)of nucleic acids is well known in the art. As indicated by standardreferences, a simple estimate of the T_(m) value may be calculated bythe equation: T_(m)=81.5+0.41(% G+C), when a nucleic acid is in aqueoussolution at 1 M NaCl (See e.g., Anderson and Young, Quantitative FilterHybridization, in Nucleic Acid Hybridization [1985]). Other referencesinclude more sophisticated computations that take structural as well assequence characteristics into account for the calculation of T_(m).

As used herein the term “stringency” is used in reference to theconditions of temperature, ionic strength, and the presence of othercompounds such as organic solvents, under which nucleic acidhybridizations are conducted. Under “low stringency conditions” anucleic acid sequence of interest will hybridize to its exactcomplement, sequences with single base mismatches, closely relatedsequences (e.g., sequences with 90% or greater homology), and sequenceshaving only partial homology (e.g., sequences with 50-90% homology).Under “medium stringency conditions,” a nucleic acid sequence ofinterest will hybridize only to its exact complement, sequences withsingle base mismatches, and closely relation sequences (e.g., 90% orgreater homology). Under “high stringency conditions,” a nucleic acidsequence of interest will hybridize only to its exact complement, and(depending on conditions such a temperature) sequences with single basemismatches. In other words, under conditions of high stringency thetemperature can be raised so as to exclude hybridization to sequenceswith single base mismatches.

“High stringency conditions” when used in reference to nucleic acidhybridization comprise conditions equivalent to binding or hybridizationat 42° C. in a solution consisting of 5×SSPE (43.8 g/l NaCl, 6.9 g/lNaH₂PO₄H₂O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS,5× Denhardt's reagent and 100 μg/ml denatured salmon sperm DNA followedby washing in a solution comprising 0.1×SSPE, 1.0% SDS at 42° C. when aprobe of about 500 nucleotides in length is employed.

“Medium stringency conditions” when used in reference to nucleic acidhybridization comprise conditions equivalent to binding or hybridizationat 42° C. in a solution consisting of 5×SSPE (43.8 g/l NaCl, 6.9 g/lNaH₂PO₄H₂O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS,5× Denhardt's reagent and 100 μg/ml denatured salmon sperm DNA followedby washing in a solution comprising 1.0×SSPE, 1.0% SDS at 42° C. when aprobe of about 500 nucleotides in length is employed.

“Low stringency conditions” comprise conditions equivalent to binding orhybridization at 42° C. in a solution consisting of 5×SSPE (43.8 g/lNaCl, 6.9 g/l NaH₂PO₄H₂O and 1.85 g/l EDTA, pH adjusted to 7.4 withNaOH), 0.1% SDS, 5× Denhardt's reagent [50× Denhardt's contains per 500ml: 5 g Ficoll (Type 400, Pharamcia), 5 g BSA (Fraction V; Sigma)] and100 μg/ml denatured salmon sperm DNA followed by washing in a solutioncomprising 5×SSPE, 0.1% SDS at 42° C. when a probe of about 500nucleotides in length is employed.

The art knows well that numerous equivalent conditions may be employedto comprise low stringency conditions; factors such as the length andnature (DNA, RNA, base composition) of the probe and nature of thetarget (DNA, RNA, base composition, present in solution or immobilized,etc.) and the concentration of the salts and other components (e.g., thepresence or absence of formamide, dextran sulfate, polyethylene glycol)are considered and the hybridization solution may be varied to generateconditions of low stringency hybridization different from, butequivalent to, the above listed conditions. In addition, the art knowsconditions that promote hybridization under conditions of highstringency (e.g., increasing the temperature of the hybridization and/orwash steps, the use of formamide in the hybridization solution, etc.)(see definition above for “stringency”).

“Amplification” is a special case of nucleic acid replication involvingtemplate specificity. It is to be contrasted with non-specific templatereplication (i.e., replication that is template-dependent but notdependent on a specific template). Template specificity is heredistinguished from fidelity of replication (i.e., synthesis of theproper polynucleotide sequence) and nucleotide (ribo- or deoxyribo-)specificity. Template specificity is frequently described in terms of“target” specificity. Target sequences are “targets” in the sense thatthey are sought to be sorted out from other nucleic acid. Amplificationtechniques have been designed primarily for this sorting out.

Template specificity is achieved in most amplification techniques by thechoice of enzyme. Amplification enzymes are enzymes that, underconditions they are used, will process only specific sequences ofnucleic acid in a heterogeneous mixture of nucleic acid. For example, inthe case of Qβ replicase, MDV-1 RNA is the specific template for thereplicase (Kacian et al., Proc. Natl. Acad. Sci. USA 69:3038 [1972]).Other nucleic acids will not be replicated by this amplification enzyme.Similarly, in the case of T7 RNA polymerase, this amplification enzymehas a stringent specificity for its own promoters (Chamberlin et al.,Nature 228:227 [1970]). In the case of T4 DNA ligase, the enzyme willnot ligate the two oligonucleotides or polynucleotides, where there is amismatch between the oligonucleotide or polynucleotide substrate and thetemplate at the ligation junction (Wu and Wallace, Genomics 4:560[1989]). Finally, Taq and Pfu polymerases, by virtue of their ability tofunction at high temperature, are found to display high specificity forthe sequences bounded and thus defined by the primers; the hightemperature results in thermodynamic conditions that favor primerhybridization with the target sequences and not hybridization withnon-target sequences (H. A. Erlich (ed.), PCR Technology, Stockton Press[1989]).

As used herein, the term “amplifiable nucleic acid” is used in referenceto nucleic acids that may be amplified by any amplification method. Itis contemplated that “amplifiable nucleic acid” will usually comprise“sample template.”

As used herein, the term “sample template” refers to nucleic acidoriginating from a sample that is analyzed for the presence of “target.”In contrast, “background template” is used in reference to nucleic acidother than sample template that may or may not be present in a sample.Background template is most often inadvertent. It may be the result ofcarryover, or it may be due to the presence of nucleic acid contaminantssought to be purified away from the sample. For example, nucleic acidsfrom organisms other than those to be detected may be present asbackground in a test sample.

As used herein, the term “primer” refers to an oligonucleotide, whetheroccurring naturally as in a purified restriction digest or producedsynthetically, that is capable of acting as a point of initiation ofsynthesis when placed under conditions in which synthesis of a primerextension product that is complementary to a nucleic acid strand isinduced, (i.e., in the presence of nucleotides and an inducing agentsuch as DNA polymerase and at a suitable temperature and pH). The primeris preferably single stranded for maximum efficiency in amplification,but may alternatively be double stranded. If double stranded, the primeris first treated to separate its strands before being used to prepareextension products. Preferably, the primer is anoligodeoxyribonucleotide. The primer must be sufficiently long to primethe synthesis of extension products in the presence of the inducingagent. The exact lengths of the primers will depend on many factors,including temperature, source of primer and the use of the method.

As used herein, the term “probe” refers to an oligonucleotide (i.e., asequence of nucleotides), whether occurring naturally as in a purifiedrestriction digest or produced synthetically, recombinantly or by PCRamplification, that is capable of hybridizing to at least a portion ofanother oligonucleotide of interest. A probe may be single-stranded ordouble-stranded. Probes are useful in the detection, identification andisolation of particular gene sequences. It is contemplated that anyprobe used in the present invention will be labeled with any “reportermolecule,” so that is detectable in any detection system, including, butnot limited to enzyme (e.g., ELISA, as well as enzyme-basedhistochemical assays), fluorescent, radioactive, and luminescentsystems. It is not intended that the present invention be limited to anyparticular detection system or label.

As used herein the term “portion” when in reference to a nucleotidesequence (as in “a portion of a given nucleotide sequence”) refers tofragments of that sequence. The fragments may range in size from fournucleotides to the entire nucleotide sequence minus one nucleotide (10nucleotides, 20, 30, 40, 50, 100, 200, etc.).

As used herein, the terms “restriction endonucleases” and “restrictionenzymes” refer to bacterial enzymes, each of which cut double-strandedDNA at or near a specific nucleotide sequence.

The terms “in operable combination,” “in operable order,” and “operablylinked” as used herein refer to the linkage of nucleic acid sequences insuch a manner that a nucleic acid molecule capable of directing thetranscription of a given gene and/or the synthesis of a desired proteinmolecule is produced. The term also refers to the linkage of amino acidsequences in such a manner so that a functional protein is produced.

The term “isolated” when used in relation to a nucleic acid, as in “anisolated oligonucleotide” or “isolated polynucleotide” refers to anucleic acid sequence that is identified and separated from at least onecomponent or contaminant with which it is ordinarily associated in itsnatural source. Isolated nucleic acid is such present in a form orsetting that is different from that in which it is found in nature. Incontrast, non-isolated nucleic acids as nucleic acids such as DNA andRNA found in the state they exist in nature. For example, a given DNAsequence (e.g., a gene) is found on the host cell chromosome inproximity to neighboring genes; RNA sequences, such as a specific mRNAsequence encoding a specific protein, are found in the cell as a mixturewith numerous other mRNAs that encode a multitude of proteins. However,isolated nucleic acid encoding a given protein includes, by way ofexample, such nucleic acid in cells ordinarily expressing the givenprotein where the nucleic acid is in a chromosomal location differentfrom that of natural cells, or is otherwise flanked by a differentnucleic acid sequence than that found in nature. The isolated nucleicacid, oligonucleotide, or polynucleotide may be present insingle-stranded or double-stranded form. When an isolated nucleic acid,oligonucleotide or polynucleotide is to be utilized to express aprotein, the oligonucleotide or polynucleotide will contain at a minimumthe sense or coding strand (i.e., the oligonucleotide or polynucleotidemay be single-stranded), but may contain both the sense and anti-sensestrands (i.e., the oligonucleotide or polynucleotide may bedouble-stranded).

As used herein, the term “purified” or “to purify” refers to the removalof components (e.g., contaminants) from a sample. For example,antibodies are purified by removal of contaminating non-immunoglobulinproteins; they are also purified by the removal of immunoglobulin thatdoes not bind to the target molecule. The removal of non-immunoglobulinproteins and/or the removal of immunoglobulins that do not bind to thetarget molecule results in an increase in the percent of target-reactiveimmunoglobulins in the sample. In another example, recombinantpolypeptides are expressed in bacterial host cells and the polypeptidesare purified by the removal of host cell proteins; the percent ofrecombinant polypeptides is thereby increased in the sample.

“Amino acid sequence” and terms such as “polypeptide” or “protein” arenot meant to limit the amino acid sequence to the complete, native aminoacid sequence associated with the recited protein molecule.

The term “native protein” as used herein to indicate that a protein doesnot contain amino acid residues encoded by vector sequences; that is,the native protein contains only those amino acids found in the proteinas it occurs in nature. A native protein may be produced by recombinantmeans or may be isolated from a naturally occurring source.

As used herein the term “portion” when in reference to a protein (as in“a portion of a given protein”) refers to fragments of that protein. Thefragments may range in size from four amino acid residues to the entireamino acid sequence minus one amino acid.

The term “Southern blot,” refers to the analysis of DNA on agarose oracrylamide gels to fractionate the DNA according to size followed bytransfer of the DNA from the gel to a solid support, such asnitrocellulose or a nylon membrane. The immobilized DNA is then probedwith a labeled probe to detect DNA species complementary to the probeused. The DNA may be cleaved with restriction enzymes prior toelectrophoresis. Following electrophoresis, the DNA may be partiallydepurinated and denatured prior to or during transfer to the solidsupport. Southern blots are a standard tool of molecular biologists (J.Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold SpringHarbor Press, NY, pp 9.31-9.58 [1989]).

The term “Northern blot,” as used herein refers to the analysis of RNAby electrophoresis of RNA on agarose gels to fractionate the RNAaccording to size followed by transfer of the RNA from the gel to asolid support, such as nitrocellulose or a nylon membrane. Theimmobilized RNA is then probed with a labeled probe to detect RNAspecies complementary to the probe used. Northern blots are a standardtool of molecular biologists (J. Sambrook, et al., supra, pp 7.39-7.52[1989]).

The term “Western blot” refers to the analysis of protein(s) (orpolypeptides) immobilized onto a support such as nitrocellulose or amembrane. The proteins are run on acrylamide gels to separate theproteins, followed by transfer of the protein from the gel to a solidsupport, such as nitrocellulose or a nylon membrane. The immobilizedproteins are then exposed to antibodies with reactivity against anantigen of interest. The binding of the antibodies may be detected byvarious methods, including the use of radiolabeled antibodies.

The term “transgene” as used herein refers to a foreign gene that isplaced into an organism by, for example, introducing the foreign geneinto newly fertilized eggs or early embryos. The term “foreign gene”refers to any nucleic acid (e.g., gene sequence) that is introduced intothe genome of an animal by experimental manipulations and may includegene sequences found in that animal so long as the introduced gene doesnot reside in the same location as does the naturally occurring gene.

As used herein, the term “vector” is used in reference to nucleic acidmolecules that transfer DNA segment(s) from one cell to another. Theterm “vehicle” is sometimes used interchangeably with “vector.” Vectorsare often derived from plasmids, bacteriophages, or plant or animalviruses.

The term “expression vector” as used herein refers to a recombinant DNAmolecule containing a desired coding sequence and appropriate nucleicacid sequences necessary for the expression of the operably linkedcoding sequence in a particular host organism. Nucleic acid sequencesnecessary for expression in prokaryotes usually include a promoter, anoperator (optional), and a ribosome binding site, often along with othersequences. Eukaryotic cells are known to utilize promoters, enhancers,and termination and polyadenylation signals.

The terms “overexpression” and “overexpressing” and grammaticalequivalents, are used in reference to levels of mRNA to indicate a levelof expression approximately 3-fold higher (or greater) than thatobserved in a given tissue in a control or non-transgenic animal. Levelsof mRNA are measured using any of a number of techniques known to thoseskilled in the art including, but not limited to Northern blot analysis.Appropriate controls are included on the Northern blot to control fordifferences in the amount of RNA loaded from each tissue analyzed (e.g.,the amount of 28S rRNA, an abundant RNA transcript present atessentially the same amount in all tissues, present in each sample canbe used as a means of normalizing or standardizing the mRNA-specificsignal observed on Northern blots). The amount of mRNA present in theband corresponding in size to the correctly spliced transgene RNA isquantified; other minor species of RNA which hybridize to the transgeneprobe are not considered in the quantification of the expression of thetransgenic mRNA.

The term “transfection” as used herein refers to the introduction offoreign DNA into eukaryotic cells. Transfection may be accomplished by avariety of means known to the art including calcium phosphate-DNAco-precipitation, DEAE-dextran-mediated transfection, polybrene-mediatedtransfection, electroporation, microinjection, liposome fusion,lipofection, protoplast fusion, retroviral infection, and biolistics.

The term “calcium phosphate co-precipitation” refers to a technique forthe introduction of nucleic acids into a cell. The uptake of nucleicacids by cells is enhanced when the nucleic acid is presented as acalcium phosphate-nucleic acid co-precipitate. The original technique ofGraham and van der Eb (Graham and van der Eb, Virol., 52:456 [1973]),has been modified by several groups to optimize conditions forparticular types of cells. The art is well aware of these numerousmodifications.

The term “stable transfection” or “stably transfected” refers to theintroduction and integration of foreign DNA into the genome of thetransfected cell. The term “stable transfectant” refers to a cell thathas stably integrated foreign DNA into the genomic DNA.

The term “transient transfection” or “transiently transfected” refers tothe introduction of foreign DNA into a cell where the foreign DNA failsto integrate into the genome of the transfected cell. The foreign DNApersists in the nucleus of the transfected cell for several days. Duringthis time the foreign DNA is subject to the regulatory controls thatgovern the expression of endogenous genes in the chromosomes. The term“transient transfectant” refers to cells that have taken up foreign DNAbut have failed to integrate this DNA.

As used herein, the term “selectable marker” refers to the use of a genethat encodes an enzymatic activity that confers the ability to grow inmedium lacking what would otherwise be an essential nutrient (e.g. theHIS3 gene in yeast cells); in addition, a selectable marker may conferresistance to an antibiotic or drug upon the cell in which theselectable marker is expressed. Selectable markers may be “dominant”; adominant selectable marker encodes an enzymatic activity that can bedetected in any eukaryotic cell line. Examples of dominant selectablemarkers include the bacterial aminoglycoside 3′ phosphotransferase gene(also referred to as the neo gene) that confers resistance to the drugG418 in mammalian cells, the bacterial hygromycin G phosphotransferase(hyg) gene that confers resistance to the antibiotic hygromycin and thebacterial xanthine-guanine phosphoribosyl transferase gene (alsoreferred to as the gpt gene) that confers the ability to grow in thepresence of mycophenolic acid. Other selectable markers are not dominantin that their use must be in conjunction with a cell line that lacks therelevant enzyme activity. Examples of non-dominant selectable markersinclude the thymidine kinase (tk) gene that is used in conjunction withtk⁻ cell lines, the CAD gene that is used in conjunction withCAD-deficient cells and the mammalian hypoxanthine-guaninephosphoribosyl transferase (hprt) gene that is used in conjunction withhprt⁻ cell lines. A review of the use of selectable markers in mammaliancell lines is provided in Sambrook, J. et al., Molecular Cloning: ALaboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, NewYork (1989) pp. 16.9-16.15.

As used herein, the term “cell culture” refers to any in vitro cultureof cells. Included within this term are continuous cell lines (e.g.,with an immortal phenotype), primary cell cultures, transformed celllines, finite cell lines (e.g., non-transformed cells), and any othercell population maintained in vitro.

As used, the term “eukaryote” refers to organisms distinguishable from“prokaryotes.” It is intended that the term encompass all organisms withcells that exhibit the usual characteristics of eukaryotes, such as thepresence of a true nucleus bounded by a nuclear membrane, within whichlie the chromosomes, the presence of membrane-bound organelles, andother characteristics commonly observed in eukaryotic organisms. Thus,the term includes, but is not limited to such organisms as fungi,protozoa, and animals (e.g., humans).

As used herein, the term “in vitro” refers to an artificial environmentand to processes or reactions that occur within an artificialenvironment. In vitro environments can consist of, but are not limitedto, test tubes and cell culture. The term “in vivo” refers to thenatural environment (e.g., an animal or a cell) and to processes orreaction that occur within a natural environment.

The terms “test compound” and “candidate compound” refer to any chemicalentity, pharmaceutical, drug, and the like that is a candidate for useto treat or prevent a disease, illness, sickness, or disorder of bodilyfunction (e.g., cancer). Test compounds comprise both known andpotential therapeutic compounds. A test compound can be determined to betherapeutic by screening using the screening methods of the presentinvention. In some embodiments of the present invention, test compoundsinclude antisense compounds.

As used herein, the term “sample” is used in its broadest sense. In onesense, it is meant to include a specimen or culture obtained from anysource, as well as biological and environmental samples. Biologicalsamples may be obtained from animals (including humans) and encompassfluids, solids, tissues, and gases. Biological samples include bloodproducts, such as plasma, serum and the like. Environmental samplesinclude environmental material such as surface matter, soil, water,crystals and industrial samples. Such examples are not however to beconstrued as limiting the sample types applicable to the presentinvention.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to compositions and methods for diseasediagnostics. In particular, the present invention provides methods andcompositions for phage microarray profiling of cancer (e.g., prostate,lung or breast cancer). The present invention further provides novelmarkers useful for the diagnosis, characterization, and treatment ofdisease (e.g., cancers). The below description illustrates the presentinvention in the context of cancer diagnosis and treatment. However, thepresent invention is not limited to use in the diagnosis and treatmentof cancer. The methods and compositions of the present invention finduse in the diagnosis and treatment of a variety of diseases including,but not limited to, inflammatory disease, autoimmune disease, cancer,cardiovascular disease, and diabetes.

When cancer is identified at the earliest stages, the probability ofcure is very high and therefore diagnostic screening tests that candetect these early stages are crucial. Tumor-associated antigensrecognized by humoral effectors of the immune system are an attractivetarget for diagnostic and therapeutic approaches to human cancer.Efforts toward the development of early detection assays for cancershave traditionally depended on single biomarker molecule. Currenttechnologies have been disappointing and have not resulted in diagnostictests suitable for clinical practice.

Serologic identification of antigens by recombinant expression cloning(SEREX) has been used for identification of few types of antigen overrecent years through screening expression cDNA libraries from humansolid tumors with sera of the autologous patients. This type ofscreening of a cDNA expression library by conventional methods, however,requires the preparation of a large number of membrane filters blottedwith bacteriophage plaques that are then searched with a specific probe.In the case of the SEREX experiments, the screening is performed usinglarge amounts of sera from cancer patients, which are usually availablein very limited quantity. The second limitation is that suchimmunoscreening procedure does not allow selection of antigens that arerecognized by sera from different patients. In addition, due to thefilter screening procedure, SEREX does not allow for high throughputscreening and thus makes it difficult to perform replicated experimentsfor the selection of antigens that can be recognized by sera from asubset of cancer patients. Furthermore, SEREX relies upon a one-stepscreening technique without affinity selection steps (biopanning).

The methods and compositions of the present invention overcome many ofthese limitations. In some embodiments, the present invention providesan effective screening test to overcome these limitations and simplifythe screening procedure by performing affinity selection of cDNAlibraries in very small volumes using, for example, T7 phage displaycDNA libraries. The platform of phage-epitope microarrays is capable ofdetecting over 2300 phage clones in one microarray using onlymicroliters of sera. Highly parallel assays using different patientsamples are easily compared using protein microarray technology thatallows for the molecular classification of cancer based on epitomicprofiles (akin to molecular profiles based on gene expression). In someembodiments, the methods of the present invention employ the recognitionof a pattern of immunologic response as a diagnostic strategy. Thepresent invention is not limited by the nature of the peptide displaysystem used.

Phage-display technology is typically based on the insertion of foreignnucleotide sequences into genes encoding for various capsid proteins ofT7 phage, resulting in a heterogeneous mixture of phages, eachdisplaying the different peptide sequence encoded by a correspondinginsert. A physical link between a displayed fusion protein and DNAencoded for it make this phage target selectable. In some embodiments,the methods of the present invention detect antibodies that are producedby patients in reaction to proteins expressed in their tumors. Thesemarkers find use as diagnostic biomarkers and therapeutic targets. Insome embodiments, the methods of the present invention employ patternrecognition of multiple markers as a diagnostic rather than any singlemarker. Features of the approach include acknowledging the heterogeneousnature of any specific kind of cancer, and using specializedbioinformatics techniques to interpret the results.

Experiments conducted during the course of development of the presentinvention resulted in the detection of a serum reaction with largenumbers of epitopes using a highly parallel phage display assay onprotein microarrays. Once the chosen epitope markers are spotted on thefinal version of the array, serum from both cancer patients and controlsare tested. In some embodiments, the results of the reaction of the serawith the various subjects are used to train a machine learning device tobuild a predictor and further to test unknown samples.

The methods and compositions of the present invention provide severaladvantages over existing methods. For example, in some embodiments, themethods of the present invention utilize fluorescent probes and laserscanner, resulting in high sensitivity and the detection of very smallsignal differences. In addition, the methods of the present inventionallow for detection at the protein expression level rather than cDNAlevel as compared to cDNA or oligo arrays. In preferred embodiments, themethods of the present invention utilize an analytical approach ratherthat a visual assessment, which results in greater consistency andreproducibility. Further, due to the high sensitivity of this technique,low amounts (e.g., only 1-2 μl) of serum samples may be used. Themethods of the present invention are rapid and allow for the analysis ofprotein-protein interactions.

I. Markers for Cancer

In some embodiments, the present invention provides markers whoseexpression is specifically altered in cancerous prostate tissues. Suchmarkers find use in the diagnosis and characterization of cancer (e.g.,prostate, lung or breast cancer).

A. Identification of Markers

In some embodiments, the phage expression profiling methods of thepresent invention (See e.g., the experimental section for a detaileddescription) are used to identify cancer markers or tumor antigens.Exemplary prostate tumor antigens include, but are not limited to, BRD2,eIF4G1, RPL22, RPL13A, HES1, and hypothetical protein XP_(—)373908.Exemplary breast cancer tumor antigens include, but are not limited to,ubiquilin 1, nucleolar protein 3 (NOL3), alpha-2-glycoprotein 1 and heatshock 70 kDa protein 8 (HSPA70).

B. Detection of Cancer Markers

In some embodiments, the present invention provides methods fordetection of expression of cancer markers (e.g., BRD2, eIF4G1, RPL22,RPL13A, HES1, hypothetical protein XP_(—)373908, ubiquilin 1, nucleolarprotein 3 (NOL3), alpha-2-glycoprotein 1 and heat shock 70 kDa protein 8(HSPA70)). In preferred embodiments, expression is measured directly(e.g., at the RNA or protein level). In some embodiments, expression isdetected in tissue samples (e.g., biopsy tissue). In other embodiments,expression is detected in bodily fluids (e.g., including but not limitedto, plasma, serum, whole blood, mucus, and urine). The present inventionfurther provides panels and kits for the detection of markers. Inpreferred embodiments, the presence of a cancer marker is used toprovide a prognosis to a subject. The information provided is also usedto direct the course of treatment. For example, if a subject is found tohave a marker indicative of a highly metastasizing tumor, additionaltherapies (e.g., hormonal or radiation therapies) can be started at aearlier point when they are more likely to be effective (e.g., beforemetastasis). In addition, if a subject is found to have a tumor that isnot responsive to hormonal therapy, the expense and inconvenience ofsuch therapies can be avoided.

In some embodiments, the present invention provides a panel for theanalysis of a plurality of markers. The panel allows for thesimultaneous analysis of multiple markers correlating withcarcinogenesis and/or metastasis. For example, a panel may includemarkers identified as correlating with cancerous tissue, metastaticcancer, localized cancer that is likely to metastasize, pre-canceroustissue that is likely to become cancerous, and pre-cancerous tissue thatis not likely to become cancerous. Depending on the subject, panels maybe analyzed alone or in combination in order to provide the bestpossible diagnosis and prognosis. Markers for inclusion on a panel areselected by screening for their predictive value using any suitablemethod, including but not limited to, those described in theillustrative examples below.

In other embodiments, the present invention provides a phage arrayprofile map comprising protein array profiles of cancers of variousstages or prognoses (e.g., likelihood of future metastasis). Such mapscan be used for comparison with patient samples. Any suitable method maybe utilized, including but not limited to, by computer comparison ofdigitized data. The comparison data is used to provide diagnoses and/orprognoses to patients.

i) Detection of RNA

In some preferred embodiments, detection of prostate cancer markers(e.g., including but not limited to, BRD2, eIF4G1, RPL22, RPL13A, HES1,hypothetical protein XP_(—)373908, ubiquilin 1, nucleolar protein 3(NOL3), alpha-2-glycoprotein 1 and heat shock 70 kDa protein 8 (HSPA70))is detected by measuring the expression of corresponding mRNA in atissue sample (e.g., prostate, breast, or lung tissue). mRNA expressionmay be measured by any suitable method.

ii) Detection of Protein

In other embodiments, gene expression of cancer markers is detected bymeasuring the expression of the corresponding protein or polypeptide.Protein expression may be detected by any suitable method. In otherembodiments, proteins are detected by their binding to an antibodyraised against the protein. The generation of antibodies is describedbelow.

Antibody binding is detected by techniques known in the art (e.g.,radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), “sandwich”immunoassays, immunoradiometric assays, gel diffusion precipitationreactions, immunodiffusion assays, in situ immunoassays (e.g., usingcolloidal gold, enzyme or radioisotope labels, for example), Westernblots, precipitation reactions, agglutination assays (e.g., gelagglutination assays, hemagglutination assays, etc.), complementfixation assays, immunofluorescence assays, protein A assays, andimmunoelectrophoresis assays, etc.

In one embodiment, antibody binding is detected by detecting a label onthe primary antibody. In another embodiment, the primary antibody isdetected by detecting binding of a secondary antibody or reagent to theprimary antibody. In a further embodiment, the secondary antibody islabeled. Many methods are known in the art for detecting binding in animmunoassay and are within the scope of the present invention.

In some embodiments, an automated detection assay is utilized. Methodsfor the automation of immunoassays include those described in U.S. Pat.Nos. 5,885,530, 4,981,785, 6,159,750, and 5,358,691, each of which isherein incorporated by reference. In some embodiments, the analysis andpresentation of results is also automated. For example, in someembodiments, software that generates a prognosis based on the presenceor absence of a series of proteins corresponding to cancer markers isutilized.

In other embodiments, the immunoassay described in U.S. Pat. Nos.5,599,677 and 5,672,480; each of which is herein incorporated byreference.

iii) Data Analysis

In some embodiments, a computer-based analysis program is used totranslate the raw data generated by the detection assay (e.g., thepresence, absence, or amount of a given marker or markers) into data ofpredictive value for a clinician. The clinician can access thepredictive data using any suitable means. Thus, in some preferredembodiments, the present invention provides the further benefit that theclinician, who is not likely to be trained in genetics or molecularbiology, need not understand the raw data. The data is presenteddirectly to the clinician in its most useful form. The clinician is thenable to immediately utilize the information in order to optimize thecare of the subject.

The present invention contemplates any method capable of receiving,processing, and transmitting the information to and from laboratoriesconducting the assays, information provides, medical personal, andsubjects. For example, in some embodiments of the present invention, asample (e.g., a biopsy or a serum or urine sample) is obtained from asubject and submitted to a profiling service (e.g., clinical lab at amedical facility, genomic profiling business, etc.), located in any partof the world (e.g., in a country different than the country where thesubject resides or where the information is ultimately used) to generateraw data. Where the sample comprises a tissue or other biologicalsample, the subject may visit a medical center to have the sampleobtained and sent to the profiling center, or subjects may collect thesample themselves (e.g., a urine sample) and directly send it to aprofiling center. Where the sample comprises previously determinedbiological information, the information may be directly sent to theprofiling service by the subject (e.g., an information card containingthe information may be scanned by a computer and the data transmitted toa computer of the profiling center using an electronic communicationsystems). Once received by the profiling service, the sample isprocessed and a profile is produced (i.e., expression data), specificfor the diagnostic or prognostic information desired for the subject.

The profile data is then prepared in a format suitable forinterpretation by a treating clinician. For example, rather thanproviding raw expression data, the prepared format may represent adiagnosis or risk assessment (e.g., likelihood of metastasis or PSAfailure) for the subject, along with recommendations for particulartreatment options. The data may be displayed to the clinician by anysuitable method. For example, in some embodiments, the profiling servicegenerates a report that can be printed for the clinician (e.g., at thepoint of care) or displayed to the clinician on a computer monitor.

In some embodiments, the information is first analyzed at the point ofcare or at a regional facility. The raw data is then sent to a centralprocessing facility for further analysis and/or to convert the raw datato information useful for a clinician or patient. The central processingfacility provides the advantage of privacy (all data is stored in acentral facility with uniform security protocols), speed, and uniformityof data analysis. The central processing facility can then control thefate of the data following treatment of the subject. For example, usingan electronic communication system, the central facility can providedata to the clinician, the subject, or researchers.

In some embodiments, the subject is able to directly access the datausing the electronic communication system. The subject may choosefurther intervention or counseling based on the results. In someembodiments, the data is used for research use. For example, the datamay be used to further optimize the inclusion or elimination of markersas useful indicators of a particular condition or stage of disease.

C. Detection of Tumor Antigens

As described above, the presence of an immune response to specificproteins expressed in cancerous cells is indicative of the presence ofcancer. Accordingly, in some embodiments, the present invention providesmethods (e.g., diagnostic methods) for detecting the presence of tumorantigens identified using the methods of the present invention (e.g.,BRD2, eIF4G1, RPL22, RPL13A, HES1, hypothetical protein XP_(—)373908,ubiquilin 1, nucleolar protein 3 (NOL3), alpha-2-glycoprotein 1 and heatshock 70 kDa protein 8 (HSPA70)). In some embodiments (e.g., where tumorantigens are expressed in cancerous cells but not non-cancerous cells),tumor antigen proteins are detected directly. In other embodiments(e.g., where the presence of an autoantibody in cancerous but notcancerous cells is indicative of the presence of cancer), autoantibodiesto the tumor antigens are detected. In preferred embodiments, tumorantigens are detected directly in tumors or cells suspected of beingcancerous.

The diagnostic methods of the present invention find utility in thediagnosis and characterization of cancers. For example, the presence ofan autoantibody to a specific protein may be indicative of a cancer. Inaddition, certain autoantibodies may be indicative of a specific stageor sub-type of the same cancer.

The information obtained is used to determine prognosis and appropriatecourse of treatment. For example, it is contemplated that individualswith a specific autoantibody or stage of cancer may respond differentlyto a given treatment than individuals lacking the antibody. Theinformation obtained from the diagnostic methods of the presentinvention thus provides for the personalization of diagnosis andtreatment.

i) Detection of Antigens

In some embodiments, antibodies are used to detect tumor antigens in abiological sample from an individual. The biological sample can be abiological fluid, such as, but not limited to, blood, serum, plasma,interstitial fluid, urine, cerebrospinal fluid, and the like, containingcells. In preferred embodiments, the biological sample comprises cellssuspected of being cancerous (e.g., cells obtained from a biopsy).

The biological samples can then be tested directly for the presence oftumor antigens using an appropriate strategy (e.g., ELISA orradioimmunoassay) and format (e.g., microwells, dipstick (e.g., asdescribed in International Patent Publication WO 93/03367), etc).Alternatively, proteins in the sample can be size separated (e.g., bypolyacrylamide gel electrophoresis (PAGE), in the presence or not ofsodium dodecyl sulfate (SDS), and the presence of tumor antigensdetected by immunoblotting (e.g., Western blotting). Immunoblottingtechniques are generally more effective with antibodies generatedagainst a peptide corresponding to an epitope of a protein, and hence,are particularly suited to the present invention.

Antibody binding is detected by techniques known in the art (e.g.,radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), “sandwich”immunoassays, immunoradiometric assays, gel diffusion precipitationreactions, immunodiffusion assays, in situ immunoassays (e.g., usingcolloidal gold, enzyme or radioisotope labels, for example), Westernblots, precipitation reactions, agglutination assays (e.g., gelagglutination assays, hemagglutination assays, etc.), complementfixation assays, immunofluorescence assays, protein A assays, andimmunoelectrophoresis assays, etc.

In one embodiment, antibody binding is detected by detecting a label onthe primary antibody. In another embodiment, the primary antibody isdetected by detecting binding of a secondary antibody or reagent to theprimary antibody. In a further embodiment, the secondary antibody islabeled. Many means are known in the art for detecting binding in animmunoassay and are within the scope of the present invention. As iswell known in the art, the immunogenic peptide should be provided freeof the carrier molecule used in any immunization protocol. For example,if the peptide was conjugated to KLH, it may be conjugated to BSA, orused directly, in a screening assay.)

In some embodiments, an automated detection assay is utilized. Methodsfor the automation of immunoassays are well known in the art (See e.g.,U.S. Pat. Nos. 5,885,530, 4,981,785, 6,159,750, and 5,358,691, each ofwhich is herein incorporated by reference). In some embodiments, theanalysis and presentation of results is also automated. For example, insome embodiments, software that generates a prognosis based on thepresence or absence of a series of antigens is utilized.

ii) Detection of Autoantibodies

In some embodiments, the presence of autoantibodies to a tumor antigenis detected. This approach to diagnosing and typing tumors isparticularly suited to tumor antigens that are present, but notimmunogenic, in normal cells and immunogenic in tumor cells. Forexample, in some embodiments, antibodies (e.g., monoclonal orpolyclonal) are generated to the autoantibodies identified during thedevelopment of the present invention. Such antibodies are then used todetect the presence of autoantibodies using any suitable technique,including but not limited to, those described above.

In other embodiments, tumor proteins are attached to a solid surface.The presence of autoantibodies is identified by contacting the solidsurface (e.g., microarray) with serum from the subject and detectingbinding to a tumor marker. One exemplary method for performing such anassay is described in the experimental section below.

iii) Other Detection Methods

The present invention is not limited to the detection methods describedabove. Any suitable detection method that allows for the specificdetection of cancerous cells may be utilized. For example, in someembodiments, the expression of RNA corresponding to a tumor antigen geneis detected by hybridization to an antisense oligonucleotide (e.g.,those described below). In other embodiments, RNA expression is detectedby hybridization assays such as Northern blots, RNase assays, reversetranscriptase PCR amplification, and the like.

In further embodiments of the present invention, the presence ofparticular sequences in the genome of a subject are detected. Suchsequences include tumor antigen sequences associated with abnormalexpression of tumor antigens (e.g., overexpression or expression at aphysiological inappropriate time). These sequences includepolymorphisms, including polymorphisms in the transcribed sequence(e.g., that effect tumor antigen processing and/or translation) andregulatory sequences such as promoters, enhances, repressors, and thelike. These sequences may also include polymorphisms in genes or controlsequences associated with factors that affect expression such astranscription factors, and the like. Any suitable method for detectingand/or identifying these sequences is within the scope of the presentinvention including, but not limited to, nucleic acid sequencing,hybridization assays (e.g., Southern blotting), single nucleotidepolymorphism assays (See e.g., U.S. Pat. No. 5,994,069, hereinincorporated by reference in its entirety), and the like.

Direct and/or indirect measures of tumor antigen expression may be usedas a marker within the scope of the present invention. Because thepresent invention provides a link between tumor antigen expression andcancer, any indication of tumor expression may be used. For example, theexpression, activation, or repression of factors involved in tumorantigen signaling or regulation may be used as surrogate measures ofexpression, so long as they are reliably correlated with tumor antigenexpression and/or cancer.

D. Molecular Fingerprint

In some embodiments, the present invention provides “molecularfingerprints” or “expression profile maps” of cancer markers or tumorantigens. Such molecular fingerprints and expression profiles provide aprofile of the presence of autoantibodies or cancer markers inparticular cancers or cancer sub-types. The profiles find use inproviding cancer diagnoses and prognoses. Such prognoses can be used todetermine treatment course of action. For example, in some embodiments,the profile of a particular cancer subtype is indicative of a cancerthat is responsive to a particular choice of therapy. In otherembodiments, profiles are indicative of the aggressiveness of aparticular cancer sub-type and are used to determine the aggressivenessof treatment to be pursued.

E. Prognostic Applications

In some embodiments, cancer markers identified using the methods andcompositions of the present invention find use in providing cancerprognoses (e.g., probability of cancer metastasis, recurrence or deathfrom cancer). In experiments conducted during the course of developmentof the present invention (See e.g., Examples 3 and 4) a correlationbetween expression profiles and cancer prognosis was observed. Forexample, a correlation between expression of tripartite motif-containing7 isoform 4, cytochrome c oxidase subunit I, nucleolar protein 3(apoptosis repressor with CARD domain), hypothetical protein AM638,putative p150, MUP1, similar to CG9996-PA, hypothetical proteinMagn028940, COG0568: DNA-directed RNA polymerase, sigma subunit, IgGkappa light chain variable region and lung cancer prognosis was observed(See Example 3).

F. Kits

In yet other embodiments, the present invention provides kits for thedetection and characterization of cancer (e.g., prostate, breast, orlung cancer). In some embodiments, the kits contain antibodies specificfor a cancer marker or tumor antigen, in addition to detection reagentsand buffers. In other embodiments, the kits contain reagents specificfor the detection of mRNA or cDNA (e.g., oligonucleotide probes orprimers). In preferred embodiments, the kits contain all of thecomponents necessary to perform a detection assay, including allcontrols, directions for performing assays, and any necessary softwarefor analysis and presentation of results.

G. In Vivo Imaging

In some embodiments, in vivo imaging techniques are used to visualizethe expression of cancer markers or tumor antigens in an animal (e.g., ahuman or non-human mammal). For example, in some embodiments, cancermarker mRNA or protein is labeled using a labeled antibody specific forthe cancer marker. A specifically bound and labeled antibody can bedetected in an individual using an in vivo imaging method, including,but not limited to, radionuclide imaging, positron emission tomography,computerized axial tomography, X-ray or magnetic resonance imagingmethod, fluorescence detection, and chemiluminescent detection. Methodsfor generating antibodies to the cancer markers of the present inventionare described below.

The in vivo imaging methods of the present invention are useful in thediagnosis of cancers that express the cancer markers or tumor antigensof the present invention (e.g., prostate cancer). In vivo imaging isused to visualize the presence of a marker indicative of the cancer.Such techniques allow for diagnosis without the use of an unpleasantbiopsy. The in vivo imaging methods of the present invention are alsouseful for providing prognoses to cancer patients. For example, thepresence of a marker indicative of cancers likely to metastasize can bedetected. The in vivo imaging methods of the present invention canfurther be used to detect metastatic cancers in other parts of the body.

In some embodiments, reagents (e.g., antibodies) specific for the cancermarkers or tumor antigens of the present invention are fluorescentlylabeled. The labeled antibodies are introduced into a subject (e.g.,orally or parenterally). Fluorescently labeled antibodies are detectedusing any suitable method (e.g., using the apparatus described in U.S.Pat. No. 6,198,107, herein incorporated by reference).

In other embodiments, antibodies are radioactively labeled. The use ofantibodies for in vivo diagnosis is well known in the art. Sumerdon etal., (Nucl. Med. Biol 17:247-254 [1990] have described an optimizedantibody-chelator for the radioimmunoscintographic imaging of tumorsusing Indium-111 as the label. Griffin et al., (J Clin Onc 9:631-640[1991]) have described the use of this agent in detecting tumors inpatients suspected of having recurrent colorectal cancer. The use ofsimilar agents with paramagnetic ions as labels for magnetic resonanceimaging is known in the art (Lauffer, Magnetic Resonance in Medicine22:339-342 [1991]). The label used will depend on the imaging modalitychosen. Radioactive labels such as Indium-111, Technetium-99m, orIodine-131 can be used for planar scans or single photon emissioncomputed tomography (SPECT). Positron emitting labels such asFluorine-19 can also be used for positron emission tomography (PET). ForMRI, paramagnetic ions such as Gadolinium (III) or Manganese (II) can beused.

Radioactive metals with half-lives ranging from 1 hour to 3.5 days areavailable for conjugation to antibodies, such as scandium-47 (3.5 days)gallium-67 (2.8 days), gallium-68 (68 minutes), technetiium-99m (6hours), and indium-111 (3.2 days), of which gallium-67, technetium-99m,and indium-111 are preferable for gamma camera imaging, gallium-68 ispreferable for positron emission tomography.

A useful method of labeling antibodies with such radiometals is by meansof a bifunctional chelating agent, such as diethylenetriaminepentaaceticacid (DTPA), as described, for example, by Khaw et al. (Science 209:295[1980]) for In-111 and Tc-99m, and by Scheinberg et al. (Science215:1511 [1982]). Other chelating agents may also be used, but the1-(p-carboxymethoxybenzyl)EDTA and the carboxycarbonic anhydride of DTPAare advantageous because their use permits conjugation without affectingthe antibody's immunoreactivity substantially.

Another method for coupling DPTA to proteins is by use of the cyclicanhydride of DTPA, as described by Hnatowich et al. (Int. J. Appl.Radiat. Isot. 33:327 [1982]) for labeling of albumin with In-111, butwhich can be adapted for labeling of antibodies. A suitable method oflabeling antibodies with Tc-99m which does not use chelation with DPTAis the pretinning method of Crockford et al., (U.S. Pat. No. 4,323,546,herein incorporated by reference).

A preferred method of labeling immunoglobulins with Tc-99m is thatdescribed by Wong et al. (Int. J. Appl. Radiat. Isot., 29:251 [1978])for plasma protein, and recently applied successfully by Wong et al. (J.Nucl. Med., 23:229 [1981]) for labeling antibodies.

In the case of the radiometals conjugated to the specific antibody, itis likewise desirable to introduce as high a proportion of theradiolabel as possible into the antibody molecule without destroying itsimmunospecificity. A further improvement may be achieved by effectingradiolabeling in the presence of the specific cancer marker of thepresent invention, to insure that the antigen binding site on theantibody will be protected. The antigen is separated after labeling.

In still further embodiments, in vivo biophotonic imaging (Xenogen,Almeda, Calif.) is utilized for in vivo imaging. This real-time in vivoimaging utilizes luciferase. The luciferase gene is incorporated intocells, microorganisms, and animals (e.g., as a fusion protein with acancer marker of the present invention). When active, it leads to areaction that emits light. A CCD camera and software is used to capturethe image and analyze it.

II. Antibodies

The present invention provides isolated antibodies. In preferredembodiments, the present invention provides monoclonal antibodies thatspecifically bind to an isolated polypeptide comprised of at least fiveamino acid residues of the cancer markers or tumor antigens describedherein (e.g., BRD2, eIF4G1, RPL22, RPL13A, HES1, hypothetical proteinXP_(—)373908, ubiquilin 1, nucleolar protein 3 (NOL3),alpha-2-glycoprotein 1 and heat shock 70 kDa protein 8 (HSPA70)). Theseantibodies find use in the diagnostic and therapeutic methods describedherein.

An antibody against a protein of the present invention may be anymonoclonal or polyclonal antibody, as long as it can recognize theprotein. Antibodies can be produced by using a protein of the presentinvention as the antigen according to a conventional antibody orantiserum preparation process.

The present invention contemplates the use of both monoclonal andpolyclonal antibodies. Any suitable method may be used to generate theantibodies used in the methods and compositions of the presentinvention, including but not limited to, those disclosed herein. Forexample, for preparation of a monoclonal antibody, protein, as such, ortogether with a suitable carrier or diluent is administered to an animal(e.g., a mammal) under conditions that permit the production ofantibodies. For enhancing the antibody production capability, completeor incomplete Freund's adjuvant may be administered. Normally, theprotein is administered once every 2 weeks to 6 weeks, in total, about 2times to about 10 times. Animals suitable for use in such methodsinclude, but are not limited to, primates, rabbits, dogs, guinea pigs,mice, rats, sheep, goats, etc.

For preparing monoclonal antibody-producing cells, an individual animalwhose antibody titer has been confirmed (e.g., a mouse) is selected, and2 days to 5 days after the final immunization, its spleen or lymph nodeis harvested and antibody-producing cells contained therein are fusedwith myeloma cells to prepare the desired monoclonal antibody producerhybridoma. Measurement of the antibody titer in antiserum can be carriedout, for example, by reacting the labeled protein, as describedhereinafter and antiserum and then measuring the activity of thelabeling agent bound to the antibody. The cell fusion can be carried outaccording to known methods, for example, the method described by Koehlerand Milstein (Nature 256:495 [1975]). As a fusion promoter, for example,polyethylene glycol (PEG) or Sendai virus (HVJ), preferably PEG is used.

Examples of myeloma cells include NS-1, P3U1, SP2/0, AP-1 and the like.The proportion of the number of antibody producer cells (spleen cells)and the number of myeloma cells to be used is preferably about 1:1 toabout 20:1. PEG (preferably PEG 1000-PEG 6000) is preferably added inconcentration of about 10% to about 80%. Cell fusion can be carried outefficiently by incubating a mixture of both cells at about 20° C. toabout 40° C., preferably about 30° C. to about 37° C. for about 1 minuteto 10 minutes.

Various methods may be used for screening for a hybridoma producing theantibody (e.g., against a tumor antigen or autoantibody of the presentinvention). For example, where a supernatant of the hybridoma is addedto a solid phase (e.g., microplate) to which antibody is adsorbeddirectly or together with a carrier and then an anti-immunoglobulinantibody (if mouse cells are used in cell fusion, anti-mouseimmunoglobulin antibody is used) or Protein A labeled with a radioactivesubstance or an enzyme is added to detect the monoclonal antibodyagainst the protein bound to the solid phase. Alternately, a supernatantof the hybridoma is added to a solid phase to which ananti-immunoglobulin antibody or Protein A is adsorbed and then theprotein labeled with a radioactive substance or an enzyme is added todetect the monoclonal antibody against the protein bound to the solidphase.

Selection of the monoclonal antibody can be carried out according to anyknown method or its modification. Normally, a medium for animal cells towhich HAT (hypoxanthine, aminopterin, thymidine) are added is employed.Any selection and growth medium can be employed as long as the hybridomacan grow. For example, RPMI 1640 medium containing 1% to 20%, preferably10% to 20% fetal bovine serum, GIT medium containing 1% to 10% fetalbovine serum, a serum free medium for cultivation of a hybridoma(SFM-101, Nissui Seiyaku) and the like can be used. Normally, thecultivation is carried out at 20° C. to 40° C., preferably 37° C. forabout 5 days to 3 weeks, preferably 1 week to 2 weeks under about 5% CO₂gas. The antibody titer of the supernatant of a hybridoma culture can bemeasured according to the same manner as described above with respect tothe antibody titer of the anti-protein in the antiserum.

Separation and purification of a monoclonal antibody (e.g., against acancer marker of the present invention) can be carried out according tothe same manner as those of conventional polyclonal antibodies such asseparation and purification of immunoglobulins, for example,salting-out, alcoholic precipitation, isoelectric point precipitation,electrophoresis, adsorption and desorption with ion exchangers (e.g.,DEAE), ultracentrifugation, gel filtration, or a specific purificationmethod wherein only an antibody is collected with an active adsorbentsuch as an antigen-binding solid phase, Protein A or Protein G anddissociating the binding to obtain the antibody.

Polyclonal antibodies may be prepared by any known method ormodifications of these methods including obtaining antibodies frompatients. For example, a complex of an immunogen (an antigen against theprotein) and a carrier protein is prepared and an animal is immunized bythe complex according to the same manner as that described with respectto the above monoclonal antibody preparation. A material containing theantibody against is recovered from the immunized animal and the antibodyis separated and purified.

As to the complex of the immunogen and the carrier protein to be usedfor immunization of an animal, any carrier protein and any mixingproportion of the carrier and a hapten can be employed as long as anantibody against the hapten, which is cross-linked on the carrier andused for immunization, is produced efficiently. For example, bovineserum albumin, bovine cycloglobulin, keyhole limpet hemocyanin, etc. maybe coupled to a hapten in a weight ratio of about 0.1 part to about 20parts, preferably, about 1 part to about 5 parts per 1 part of thehapten.

In addition, various condensing agents can be used for coupling of ahapten and a carrier. For example, glutaraldehyde, carbodiimide,maleimide activated ester, activated ester reagents containing thiolgroup or dithiopyridyl group, and the like find use with the presentinvention. The condensation product as such or together with a suitablecarrier or diluent is administered to a site of an animal that permitsthe antibody production. For enhancing the antibody productioncapability, complete or incomplete Freund's adjuvant may beadministered. Normally, the protein is administered once every 2 weeksto 6 weeks, in total, about 3 times to about 10 times.

The polyclonal antibody is recovered from blood, ascites and the like,of an animal immunized by the above method. The antibody titer in theantiserum can be measured according to the same manner as that describedabove with respect to the supernatant of the hybridoma culture.Separation and purification of the antibody can be carried out accordingto the same separation and purification method of immunoglobulin as thatdescribed with respect to the above monoclonal antibody.

The protein used herein as the immunogen is not limited to anyparticular type of immunogen. For example, a cancer marker of thepresent invention (further including a gene having a nucleotide sequencepartly altered) can be used as the immunogen. Further, fragments of theprotein may be used. Fragments may be obtained by any methods including,but not limited to expressing a fragment of the gene, enzymaticprocessing of the protein, chemical synthesis, and the like.

III. Drug Screening

In some embodiments, the present invention provides drug screeningassays (e.g., to screen for anticancer drugs). The screening methods ofthe present invention utilize cancer markers and tumor antigensidentified using the methods of the present invention. For example, insome embodiments, the present invention provides methods of screeningfor compound that alter (e.g., increase or decrease) the expression ofcancer marker or tumor antigen genes. In some embodiments, candidatecompounds are antisense agents (e.g., oligonucleotides) directed againstcancer markers. See below for a discussion of antisense therapy. Inother embodiments, candidate compounds are antibodies that specificallybind to a cancer marker or tumor antigen of the present invention.

In one screening method, candidate compounds are evaluated for theirability to alter cancer marker expression by contacting a compound witha cell expressing a cancer marker and then assaying for the effect ofthe candidate compounds on expression. In some embodiments, the effectof candidate compounds on expression of a cancer marker gene is assayedfor by detecting the level of cancer marker or tumor antigen mRNAexpressed by the cell. mRNA expression can be detected by any suitablemethod. In other embodiments, the effect of candidate compounds onexpression of cancer marker or tumor antigen genes is assayed bymeasuring the level of polypeptide encoded by the cancer markers. Thelevel of polypeptide expressed can be measured using any suitablemethod, including but not limited to, those disclosed herein.

Specifically, the present invention provides screening methods foridentifying modulators, i.e., candidate or test compounds or agents(e.g., proteins, peptides, peptidomimetics, peptoids, small molecules orother drugs) which bind to cancer markers or tumor antigens of thepresent invention, have an inhibitory (or stimulatory) effect on, forexample, cancer marker or tumor antigen expression or activity, or havea stimulatory or inhibitory effect on, for example, the expression oractivity of a cancer marker or tumor antigen substrate. Compounds thusidentified can be used to modulate the activity of target gene products(e.g., cancer marker or tumor antigen genes) either directly orindirectly in a therapeutic protocol, to elaborate the biologicalfunction of the target gene product, or to identify compounds thatdisrupt normal target gene interactions. Compounds that inhibit theactivity or expression of cancer markers or tumor antigens are useful inthe treatment of proliferative disorders, e.g., cancer, particularlymetastatic (e.g., androgen independent) prostate cancer.

In one embodiment, the invention provides assays for screening candidateor test compounds that are substrates of a cancer marker or tumorantigen protein or polypeptide or a biologically active portion thereof.In another embodiment, the invention provides assays for screeningcandidate or test compounds that bind to or modulate the activity of acancer marker or tumor antigen protein or polypeptide or a biologicallyactive portion thereof.

The test compounds of the present invention can be obtained using any ofthe numerous approaches in combinatorial library methods known in theart, including biological libraries; peptoid libraries (libraries ofmolecules having the functionalities of peptides, but with a novel,non-peptide backbone, which are resistant to enzymatic degradation butwhich nevertheless remain bioactive; see, e.g., Zuckennann et al., J.Med. Chem. 37: 2678-85 [1994]); spatially addressable parallel solidphase or solution phase libraries; synthetic library methods requiringdeconvolution; the ‘one-bead one-compound’ library method; and syntheticlibrary methods using affinity chromatography selection. The biologicallibrary and peptoid library approaches are preferred for use withpeptide libraries, while the other four approaches are applicable topeptide, non-peptide oligomer or small molecule libraries of compounds(Lam (1997) Anticancer Drug Des. 12:145).

Examples of methods for the synthesis of molecular libraries can befound in the art, for example in: DeWitt et al., Proc. Natl. Acad. Sci.U.S.A. 90:6909 [1993]; Erb et al., Proc. Nalt. Acad. Sci. USA 91:11422[1994]; Zuckermann et al., J. Med. Chem. 37:2678 [1994]; Cho et al.,Science 261:1303 [1993]; Carrell et al., Angew. Chem. Int. Ed. Engl.33.2059 [1994]; Carell et al., Angew. Chem. Int. Ed. Engl. 33:2061[1994]; and Gallop et al., J. Med. Chem. 37:1233 [1994].

Libraries of compounds may be presented in solution (e.g., Houghten,Biotechniques 13:412-421 [1992]), or on beads (Lam, Nature 354:82-84[1991]), chips (Fodor, Nature 364:555-556 [1993]), bacteria or spores(U.S. Pat. No. 5,223,409; herein incorporated by reference), plasmids(Cull et al., Proc. Natl. Acad. Sci. USA 89:1865-1869 [1992]) or onphage (Scott and Smith, Science 249:386-390 [1990]; Devlin Science249:404-406 [1990]; Cwirla et al., Proc. Natl. Acad. Sci. 87:6378-6382[1990]; Felici, J. Mol. Biol. 222:301 [1991]).

In one embodiment, an assay is a cell-based assay in which a cell thatexpresses a cancer marker or tumor antigen protein or biologicallyactive portion thereof is contacted with a test compound, and theability of the test compound to the modulate cancer marker's activity isdetermined. Determining the ability of the test compound to modulatecancer marker activity can be accomplished by monitoring, for example,changes in enzymatic activity. The cell, for example, can be ofmammalian origin.

The ability of the test compound to modulate cancer marker or tumorantigen binding to a compound, e.g., a cancer marker substrate, can alsobe evaluated. This can be accomplished, for example, by coupling thecompound, e.g., the substrate, with a radioisotope or enzymatic labelsuch that binding of the compound, e.g., the substrate, to a cancermarker can be determined by detecting the labeled compound, e.g.,substrate, in a complex.

Alternatively, the cancer marker or tumor antigen is coupled with aradioisotope or enzymatic label to monitor the ability of a testcompound to modulate cancer marker binding to a cancer marker or tumorantigen substrate in a complex. For example, compounds (e.g.,substrates) can be labeled with ¹²⁵I, ³⁵S ¹⁴C or ³H, either directly orindirectly, and the radioisotope detected by direct counting ofradioemmission or by scintillation counting. Alternatively, compoundscan be enzymatically labeled with, for example, horseradish peroxidase,alkaline phosphatase, or luciferase, and the enzymatic label detected bydetermination of conversion of an appropriate substrate to product.

The ability of a compound (e.g., a cancer marker substrate) to interactwith a cancer marker with or without the labeling of any of theinteractants can be evaluated. For example, a microphysiometer can beused to detect the interaction of a compound with a cancer markerwithout the labeling of either the compound or the cancer marker(McConnell et al. Science 257:1906-1912 [1992]). As used herein, a“microphysiometer” (e.g., Cytosensor) is an analytical instrument thatmeasures the rate at which a cell acidifies its environment using alight-addressable potentiometric sensor (LAPS). Changes in thisacidification rate can be used as an indicator of the interactionbetween a compound and cancer markers.

In yet another embodiment, a cell-free assay is provided in which acancer marker or tumor antigen protein or biologically active portionthereof is contacted with a test compound and the ability of the testcompound to bind to the cancer marker or tumor antigen protein orbiologically active portion thereof is evaluated. Preferred biologicallyactive portions of the cancer marker or tumor antigen proteins to beused in assays of the present invention include fragments thatparticipate in interactions with substrates or other proteins, e.g.,fragments with high surface probability scores.

Cell-free assays involve preparing a reaction mixture of the target geneprotein and the test compound under conditions and for a time sufficientto allow the two components to interact and bind, thus forming a complexthat can be removed and/or detected.

The interaction between two molecules can also be detected, e.g., usingfluorescence energy transfer (FRET) (see, for example, Lakowicz et al.,U.S. Pat. No. 5,631,169; Stavrianopoulos et al., U.S. Pat. No.4,968,103; each of which is herein incorporated by reference). Afluorophore label is selected such that a first donor molecule's emittedfluorescent energy will be absorbed by a fluorescent label on a second,‘acceptor’ molecule, which in turn is able to fluoresce due to theabsorbed energy.

Alternately, the ‘donor’ protein molecule may simply utilize the naturalfluorescent energy of tryptophan residues. Labels are chosen that emitdifferent wavelengths of light, such that the ‘acceptor’ molecule labelmay be differentiated from that of the ‘donor’. Since the efficiency ofenergy transfer between the labels is related to the distance separatingthe molecules, the spatial relationship between the molecules can beassessed. In a situation in which binding occurs between the molecules,the fluorescent emission of the ‘acceptor’ molecule label in 1 5 theassay should be maximal. An FRET binding event can be convenientlymeasured through standard fluorometric detection means well known in theart (e.g., using a fluorimeter).

In another embodiment, determining the ability of the cancer marker ortumor antigen protein to bind to a target molecule can be accomplishedusing real-time Biomolecular Interaction Analysis (BIA) (see, e.g.,Sjolander and Urbaniczky, Anal. Chem. 63:2338-2345 [1991] and Szabo etal. Curr. Opin. Struct. Biol. 5:699-705 [1995]). “Surface plasmonresonance” or “BIA” detects biospecific interactions in real time,without labeling any of the interactants (e.g., BIAcore). Changes in themass at the binding surface (indicative of a binding event) result inalterations of the refractive index of light near the surface (theoptical phenomenon of surface plasmon resonance (SPR)), resulting in adetectable signal that can be used as an indication of real-timereactions between biological molecules.

In one embodiment, the target gene product or the test substance isanchored onto a solid phase. The target gene product/test compoundcomplexes anchored on the solid phase can be detected at the end of thereaction. Preferably, the target gene product can be anchored onto asolid surface, and the test compound, (which is not anchored), can belabeled, either directly or indirectly, with detectable labels discussedherein.

It may be desirable to immobilize cancer markers, an anti-cancer markerantibody or its target molecule to facilitate separation of complexedfrom non-complexed forms of one or both of the proteins, as well as toaccommodate automation of the assay. Binding of a test compound to acancer marker protein, or interaction of a cancer marker protein with atarget molecule in the presence and absence of a candidate compound, canbe accomplished in any vessel suitable for containing the reactants.Examples of such vessels include microtiter plates, test tubes, andmicro-centrifuge tubes. In one embodiment, a fusion protein can beprovided which adds a domain that allows one or both of the proteins tobe bound to a matrix. For example, glutathione-S-transferase-cancermarker fusion proteins or glutathione-S-transferase/target fusionproteins can be adsorbed onto glutathione Sepharose beads (SigmaChemical, St. Louis, Mo.) or glutathione-derivatized microtiter plates,which are then combined with the test compound or the test compound andeither the non-adsorbed target protein or cancer marker protein, and themixture incubated under conditions conducive for complex formation(e.g., at physiological conditions for salt and pH). Followingincubation, the beads or microtiter plate wells are washed to remove anyunbound components, the matrix immobilized in the case of beads, complexdetermined either directly or indirectly, for example, as describedabove.

Alternatively, the complexes can be dissociated from the matrix, and thelevel of cancer markers binding or activity determined using standardtechniques. Other techniques for immobilizing either cancer markersprotein or a target molecule on matrices include using conjugation ofbiotin and streptavidin. Biotinylated cancer marker protein or targetmolecules can be prepared from biotin-NHS(N-hydroxy-succinimide) usingtechniques known in the art (e.g., biotinylation kit, Pierce Chemicals,Rockford, EL), and immobilized in the wells of streptavidin-coated 96well plates (Pierce Chemical).

In order to conduct the assay, the non-immobilized component is added tothe coated surface containing the anchored component. After the reactionis complete, unreacted components are removed (e.g., by washing) underconditions such that any complexes formed will remain immobilized on thesolid surface. The detection of complexes anchored on the solid surfacecan be accomplished in a number of ways. Where the previouslynon-immobilized component is pre-labeled, the detection of labelimmobilized on the surface indicates that complexes were formed. Wherethe previously non-immobilized component is not pre-labeled, an indirectlabel can be used to detect complexes anchored on the surface; e.g.,using a labeled antibody specific for the immobilized component (theantibody, in turn, can be directly labeled or indirectly labeled with,e.g., a labeled anti-IgG antibody).

This assay is performed utilizing antibodies reactive with cancer markeror tumor antigen protein or target molecules but which do not interferewith binding of the cancer markers protein to its target molecule. Suchantibodies can be derivatized to the wells of the plate, and unboundtarget or cancer markers protein trapped in the wells by antibodyconjugation. Methods for detecting such complexes, in addition to thosedescribed above for the GST-immobilized complexes, includeimmunodetection of complexes using antibodies reactive with the cancermarker protein or target molecule, as well as enzyme-linked assays whichrely on detecting an enzymatic activity associated with the cancermarker or tumor antigen protein or target molecule.

Alternatively, cell free assays can be conducted in a liquid phase. Insuch an assay, the reaction products are separated from unreactedcomponents, by any of a number of standard techniques, including, butnot limited to: differential centrifugation (see, for example, Rivas andMinton, Trends Biochem Sci 18:284-7 [1993]); chromatography (gelfiltration chromatography, ion-exchange chromatography); electrophoresis(see, e.g., Ausubel et al., eds. Current Protocols in Molecular Biology1999, J. Wiley: New York.); and immunoprecipitation (see, for example,Ausubel et al., eds. Current Protocols in Molecular Biology 1999, J.Wiley: New York). Such resins and chromatographic techniques are knownto one skilled in the art (See e.g., Heegaard J. Mol. Recognit 11:141-8[1998]; Hageand Tweed J. Chromatogr. Biomed. Sci. Appl 699:499-525[1997]). Further, fluorescence energy transfer may also be convenientlyutilized, as described herein, to detect binding without furtherpurification of the complex from solution.

The assay can include contacting the cancer marker or tumor antigenprotein or biologically active portion thereof with a known compoundthat binds the cancer marker or tumor antigen to form an assay mixture,contacting the assay mixture with a test compound, and determining theability of the test compound to interact with a cancer marker or tumorantigen protein, wherein determining the ability of the test compound tointeract with a cancer marker or tumor antigen protein includesdetermining the ability of the test compound to preferentially bind tocancer markers or tumor antigens or biologically active portion thereof,or to modulate the activity of a target molecule, as compared to theknown compound.

To the extent that cancer markers can, in vivo, interact with one ormore cellular or extracellular macromolecules, such as proteins,inhibitors of such an interaction are useful. A homogeneous assay can beused can be used to identify inhibitors.

For example, a preformed complex of the target gene product and theinteractive cellular or extracellular binding partner product isprepared such that either the target gene products or their bindingpartners are labeled, but the signal generated by the label is quencheddue to complex formation (see, e.g., U.S. Pat. No. 4,109,496, hereinincorporated by reference, that utilizes this approach forimmunoassays). The addition of a test substance that competes with anddisplaces one of the species from the preformed complex will result inthe generation of a signal above background. In this way, testsubstances that disrupt target gene product-binding partner interactioncan be identified. Alternatively, cancer markers protein can be used asa “bait protein” in a two-hybrid assay or three-hybrid assay (see, e.g.,U.S. Pat. No. 5,283,317; Zervos et al., Cell 72:223-232 [1993]; Maduraet al., J. Biol. Chem. 268.12046-12054 [1993]; Bartel et al.,Biotechniques 14:920-924 [1993]; Iwabuchi et al., Oncogene 8:1693-1696[1993]; and Brent WO 94/10300; each of which is herein incorporated byreference), to identify other proteins, that bind to or interact withcancer markers or tumor antigens (“cancer marker-binding proteins” or“cancer marker-bp”) and are involved in cancer marker or tumor antigenactivity. Such cancer marker-bps can be activators or inhibitors ofsignals by the cancer marker proteins or targets as, for example,downstream elements of a cancer markers-mediated signaling pathway.

Modulators of cancer marker or tumor antigen expression can also beidentified. For example, a cell or cell free mixture is contacted with acandidate compound and the expression of cancer marker or tumor antigenmRNA or protein evaluated relative to the level of expression of cancermarker or tumor antigen mRNA or protein in the absence of the candidatecompound. When expression of cancer marker or tumor antigen mRNA orprotein is greater in the presence of the candidate compound than in itsabsence, the candidate compound is identified as a stimulator of cancermarker or tumor antigen mRNA or protein expression. Alternatively, whenexpression of cancer marker or tumor antigen mRNA or protein is less(i.e., statistically significantly less) in the presence of thecandidate compound than in its absence, the candidate compound isidentified as an inhibitor of cancer marker or tumor antigen mRNA orprotein expression. The level of cancer marker or tumor antigen mRNA orprotein expression can be determined by methods described herein fordetecting cancer marker or tumor antigen mRNA or protein.

A modulating agent can be identified using a cell-based or a cell freeassay, and the ability of the agent to modulate the activity of a cancermarker or tumor antigen protein can be confirmed in vivo, e.g., in ananimal such as an animal model for a disease (e.g., an animal withprostate, breast or lung cancer or metastatic prostate, breast, or lungcancer; or an animal harboring a xenograft of a prostate, lung, orbreast cancer from an animal (e.g., human) or cells from a cancerresulting from metastasis of a prostate, breast, or lung cancer (e.g.,to a lymph node, bone, or liver), or cells from a prostate, breast, orlung cancer cell line.

This invention further pertains to novel agents identified by theabove-described screening assays (See e.g., below description of cancertherapies). Accordingly, it is within the scope of this invention tofurther use an agent identified as described herein (e.g., a cancermarker modulating agent, an antisense cancer marker nucleic acidmolecule, a siRNA molecule, a cancer marker specific antibody, or acancer marker-binding partner) in an appropriate animal model (such asthose described herein) to determine the efficacy, toxicity, sideeffects, or mechanism of action, of treatment with such an agent.Furthermore, novel agents identified by the above-described screeningassays can be, e.g., used for treatments as described herein.

IV. Cancer Therapies

In some embodiments, the present invention provides therapies for cancer(e.g., prostate cancer). In some embodiments, therapies target cancermarkers or tumor antigens identified using the phage array profilingmethods of the present invention (e.g., BRD2, eIF4G1, RPL22, RPL13A,HES1, hypothetical protein XP_(—)373908, ubiquilin 1, nucleolar protein3 (NOL3), alpha-2-glycoprotein 1 and heat shock 70 kDa protein 8(HSPA70)).

A. Immunotherapy

The tumor antigens identified during the development of the presentinvention find use in cancer immunotherapy. Such methods areimprovements over the non-specific chemotherapeutic cancer therapiescurrently available. For example, in some embodiments, tumor antigensare used to generate therapeutic antibodies. In other embodiments, thetumor antigens of the present invention find use in the generation ofcancer vaccines.

i) Antibody Immunotherapy

In some embodiments, the present invention provides therapy for cancercomprising the administration of therapeutic antibodies (See e.g., U.S.Pat. Nos. 6,180,357; and 6,051,230; both of which are hereinincorporated by reference).

In some embodiments, the therapeutic antibodies comprise an antibodygenerated against a tumor antigen of the present invention (e.g., BRD2,eIF4G1, RPL22, RPL13A, HES 1, hypothetical protein XP_(—)373908,ubiquilin 1, nucleolar protein 3 (NOL3), alpha-2-glycoprotein 1 and heatshock 70 kDa protein 8 (HSPA70)) conjugated to a cytotoxic agent. Suchantibodies are particularly suited for targeting tumor antigensexpressed on tumor cells but not normal cells. In such embodiments, atumor specific therapeutic agent is generated that does not targetnormal cells, thus reducing many of the detrimental side effects oftraditional chemotherapy. For certain applications, it is envisionedthat the therapeutic agents will be pharmacologic agents will serve asuseful agents for attachment to antibodies or growth factors,particularly cytotoxic or otherwise anticellular agents having theability to kill or suppress the growth or cell division of endothelialcells. The present invention contemplates the use of any pharmacologicagent that can be conjugated to an antibody, and delivered in activeform. Exemplary anticellular agents include chemotherapeutic agents,radioisotopes, and cytotoxins. The therapeutic antibodies of the presentinvention may include a variety of cytotoxic moieties, including but notlimited to, radioactive isotopes (e.g., iodine-131, iodine-123,technicium-99m, indium-111, rhenium-188, rhenium-186, gallium-67,copper-67, yttrium-90, iodine-125 or astatine-211), hormones such as asteroid, antimetabolites such as cytosines (e.g., arabinoside,fluorouracil, methotrexate or aminopterin; an anthracycline; mitomycinC), vinca alkaloids (e.g., demecolcine; etoposide; mithramycin), andantitumor alkylating agent such as chlorambucil or melphalan. Otherembodiments may include agents such as a coagulant, a cytokine, growthfactor, bacterial endotoxin or the lipid A moiety of bacterialendotoxin. For example, in some embodiments, therapeutic agents willinclude plant-, fungus- or bacteria-derived toxin, such as an A chaintoxins, a ribosome inactivating protein, α-sarcin, aspergillin,restrictocin, a ribonuclease, diphtheria toxin or pseudomonas exotoxin,to mention just a few examples. In some preferred embodiments,deglycosylated ricin A chain is utilized.

In any event, it is proposed that agents such as these may, if desired,be successfully conjugated to an antibody, in a manner that will allowtheir targeting, internalization, release or presentation to bloodcomponents at the site of the targeted tumor cells as required usingknown conjugation technology (See, e.g., Ghose et al., Methods Enzymol.,93:280 [1983]).

For example, in some embodiments the present invention providesimmunotoxins targeted to tumor antigens of the present invention.Immunotoxins are conjugates of a specific targeting agent typically atumor-directed antibody or fragment, with a cytotoxic agent, such as atoxin moiety. The targeting agent directs the toxin to, and therebyselectively kills, cells carrying the targeted antigen. In someembodiments, therapeutic antibodies employ crosslinkers that providehigh in vivo stability (Thorpe et al., Cancer Res., 48:6396 [1988]).

In other embodiments, particularly those involving treatment of solidtumors, antibodies are designed to have a cytotoxic or otherwiseanticellular effect against the tumor vasculature, by suppressing thegrowth or cell division of the vascular endothelial cells. This attackis intended to lead to a tumor-localized vascular collapse, deprivingthe tumor cells, particularly those tumor cells distal of thevasculature, of oxygen and nutrients, ultimately leading to cell deathand tumor necrosis.

In preferred embodiments, antibody based therapeutics are formulated aspharmaceutical compositions and described above. In preferredembodiments, administration of an antibody composition of the presentinvention results in a measurable decrease in cancer (e.g., decrease orelimination of tumor).

ii) Cancer Vaccines

In some embodiments, the present invention provides cancer vaccinesdirected against a specific cancer. Cancer vaccines induce a systemictumor-specific immune response. Such a response is capable oferadicating tumor cells anywhere in the body (e.g., metastatic tumorcells). Methods for generating tumor vaccines are well known in the art(See e.g., U.S. Pat. Nos. 5,994,523; 5,972,334; 5,904,920; 5,674,486;and 6,207,147; each of which is herein incorporated by reference).

In some embodiments, tumor vaccines are administered when cancer isfirst detected (e.g., concurrently with other therapeutics such aschemotherapy). In other embodiments, cancer vaccines are administeredfollowing treatment (e.g., surgical resection or chemotherapy) toprevent relapse or metastases. In yet other embodiments, cancer vaccinesare administered prophylactically (e.g., to those at risk of a certaincancer).

In some embodiments, the cancer vaccines of the present inventioncomprise one or more tumor antigens in a pharmaceutical composition(e.g., those described above). In some embodiments, the tumor antigen isinactivated prior to administration. In other embodiments, the vaccinefurther comprises one or more additional therapeutic agents (e.g.,cytokines or cytokine expressing cells).

In some embodiments (e.g., the method described in U.S. Pat. No.5,674,486, herein incorporated by reference), selected cells from apatient, such as fibroblasts, obtained, for example, from a routine skinbiopsy, are genetically modified to express one or more cytokines.Alternatively, patient cells that may normally serve as antigenpresenting cells in the immune system such as macrophages, monocytes,and lymphocytes may also be genetically modified to express one or morecytokines. The cytokine expressing cells are then mixed with thepatient's tumor antigens (e.g., a tumor antigen of the presentinvention), for example in the form of irradiated tumor cells, oralternatively in the form of purified natural or recombinant tumorantigen, and employed in immunizations, for example subcutaneously, toinduce systemic anti-tumor immunity.

The vaccines of the present invention may be administered using anysuitable method, including but not limited to, those described above. Inpreferred embodiments, administration of a cancer vaccine of the presentinvention results in elimination (e.g., decrease or elimination oftumors) or prevention of detectable cancer cells.

B. Antisense Therapies

In some embodiments, the present invention targets the expression ofcancer markers. For example, in some embodiments, the present inventionemploys compositions comprising oligomeric antisense compounds,particularly oligonucleotides (e.g., those identified in the drugscreening methods described above), for use in modulating the functionof nucleic acid molecules encoding cancer markers of the presentinvention (e.g., BRD2, eIF4G1, RPL22, RPL13A, HES1, hypothetical proteinXP_(—)373908, ubiquilin 1, nucleolar protein 3 (NOL3),alpha-2-glycoprotein 1 and heat shock 70 kDa protein 8 (HSPA70)),ultimately modulating the amount of cancer marker expressed. This isaccomplished by providing antisense compounds that specificallyhybridize with one or more nucleic acids encoding cancer markers of thepresent invention. The specific hybridization of an oligomeric compoundwith its target nucleic acid interferes with the normal function of thenucleic acid. This modulation of function of a target nucleic acid bycompounds that specifically hybridize to it is generally referred to as“antisense.” The functions of DNA to be interfered with includereplication and transcription. The functions of RNA to be interferedwith include all vital functions such as, for example, translocation ofthe RNA to the site of protein translation, translation of protein fromthe RNA, splicing of the RNA to yield one or more mRNA species, andcatalytic activity that may be engaged in or facilitated by the RNA. Theoverall effect of such interference with target nucleic acid function ismodulation of the expression of cancer markers of the present invention.In the context of the present invention, “modulation” means either anincrease (stimulation) or a decrease (inhibition) in the expression of agene. For example, expression may be inhibited to potentially preventtumor proliferation.

It is preferred to target specific nucleic acids for antisense.“Targeting” an antisense compound to a particular nucleic acid, in thecontext of the present invention, is a multistep process. The processusually begins with the identification of a nucleic acid sequence whosefunction is to be modulated. This may be, for example, a cellular gene(or mRNA transcribed from the gene) whose expression is associated witha particular disorder or disease state, or a nucleic acid molecule froman infectious agent. In the present invention, the target is a nucleicacid molecule encoding a cancer marker of the present invention. Thetargeting process also includes determination of a site or sites withinthis gene for the antisense interaction to occur such that the desiredeffect, e.g., detection or modulation of expression of the protein, willresult. Within the context of the present invention, a preferredintragenic site is the region encompassing the translation initiation ortermination codon of the open reading frame (ORF) of the gene. Since thetranslation initiation codon is typically 5′-AUG (in transcribed mRNAmolecules; 5′-ATG in the corresponding DNA molecule), the translationinitiation codon is also referred to as the “AUG codon,” the “startcodon” or the “AUG start codon”. A minority of genes have a translationinitiation codon having the RNA sequence 5′-GUG, 5′-UUG or 5′-CUG, and5′-AUA, 5′-ACG and 5′-CUG have been shown to function in vivo. Thus, theterms “translation initiation codon” and “start codon” can encompassmany codon sequences, even though the initiator amino acid in eachinstance is typically methionine (in eukaryotes) or formylmethionine (inprokaryotes). Eukaryotic and prokaryotic genes may have two or morealternative start codons, any one of which may be preferentiallyutilized for translation initiation in a particular cell type or tissue,or under a particular set of conditions. In the context of the presentinvention, “start codon” and “translation initiation codon” refer to thecodon or codons that are used in vivo to initiate translation of an mRNAmolecule transcribed from a gene encoding a tumor antigen of the presentinvention, regardless of the sequence(s) of such codons.

Translation termination codon (or “stop codon”) of a gene may have oneof three sequences (i.e., 5′-UAA, 5′-UAG and 5′-UGA; the correspondingDNA sequences are 5′-TAA, 5′-TAG and 5′-TGA, respectively). The terms“start codon region” and “translation initiation codon region” refer toa portion of such an mRNA or gene that encompasses from about 25 toabout 50 contiguous nucleotides in either direction (i.e., 5′ or 3′)from a translation initiation codon. Similarly, the terms “stop codonregion” and “translation termination codon region” refer to a portion ofsuch an mRNA or gene that encompasses from about 25 to about 50contiguous nucleotides in either direction (i.e., 5′ or 3′) from atranslation termination codon.

The open reading frame (ORF) or “coding region,” which refers to theregion between the translation initiation codon and the translationtermination codon, is also a region that may be targeted effectively.Other target regions include the 5′ untranslated region (5′ UTR),referring to the portion of an mRNA in the 5′ direction from thetranslation initiation codon, and thus including nucleotides between the5′ cap site and the translation initiation codon of an mRNA orcorresponding nucleotides on the gene, and the 3′ untranslated region(3′ UTR), referring to the portion of an mRNA in the 3′ direction fromthe translation termination codon, and thus including nucleotidesbetween the translation termination codon and 3′ end of an mRNA orcorresponding nucleotides on the gene. The 5′ cap of an mRNA comprisesan N7-methylated guanosine residue joined to the 5′-most residue of themRNA via a 5′-5′ triphosphate linkage. The 5′ cap region of an mRNA isconsidered to include the 5′ cap structure itself as well as the first50 nucleotides adjacent to the cap. The cap region may also be apreferred target region.

Although some eukaryotic mRNA transcripts are directly translated, manycontain one or more regions, known as “introns,” that are excised from atranscript before it is translated. The remaining (and thereforetranslated) regions are known as “exons” and are spliced together toform a continuous mRNA sequence. mRNA splice sites (i.e., intron-exonjunctions) may also be preferred target regions, and are particularlyuseful in situations where aberrant splicing is implicated in disease,or where an overproduction of a particular mRNA splice product isimplicated in disease. Aberrant fusion junctions due to rearrangementsor deletions are also preferred targets. It has also been found thatintrons can also be effective, and therefore preferred, target regionsfor antisense compounds targeted, for example, to DNA or pre-mRNA.

In some embodiments, target sites for antisense inhibition areidentified using commercially available software programs (e.g.,Biognostik, Gottingen, Germany; SysArris Software, Bangalore, India;Antisense Research Group, University of Liverpool, Liverpool, England;GeneTrove, Carlsbad, Calif.). In other embodiments, target sites forantisense inhibition are identified using the accessible site methoddescribed in U.S. Patent WO0198537A2, herein incorporated by reference.

Once one or more target sites have been identified, oligonucleotides arechosen that are sufficiently complementary to the target (i.e.,hybridize sufficiently well and with sufficient specificity) to give thedesired effect. For example, in preferred embodiments of the presentinvention, antisense oligonucleotides are targeted to or near the startcodon.

In the context of this invention, “hybridization,” with respect toantisense compositions and methods, means hydrogen bonding, which may beWatson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, betweencomplementary nucleoside or nucleotide bases. For example, adenine andthymine are complementary nucleobases that pair through the formation ofhydrogen bonds. It is understood that the sequence of an antisensecompound need not be 100% complementary to that of its target nucleicacid to be specifically hybridizable. An antisense compound isspecifically hybridizable when binding of the compound to the target DNAor RNA molecule interferes with the normal function of the target DNA orRNA to cause a loss of utility, and there is a sufficient degree ofcomplementarity to avoid non-specific binding of the antisense compoundto non-target sequences under conditions in which specific binding isdesired (i.e., under physiological conditions in the case of in vivoassays or therapeutic treatment, and in the case of in vitro assays,under conditions in which the assays are performed).

Antisense compounds are commonly used as research reagents anddiagnostics. For example, antisense oligonucleotides, which are able toinhibit gene expression with specificity, can be used to elucidate thefunction of particular genes. Antisense compounds are also used, forexample, to distinguish between functions of various members of abiological pathway.

The specificity and sensitivity of antisense is also applied fortherapeutic uses. For example, antisense oligonucleotides have beenemployed as therapeutic moieties in the treatment of disease states inanimals and man. Antisense oligonucleotides have been safely andeffectively administered to humans and numerous clinical trials arepresently underway. It is thus established that oligonucleotides areuseful therapeutic modalities that can be configured to be useful intreatment regimes for treatment of cells, tissues, and animals,especially humans.

While antisense oligonucleotides are a preferred form of antisensecompound, the present invention comprehends other oligomeric antisensecompounds, including but not limited to oligonucleotide mimetics such asare described below. The antisense compounds in accordance with thisinvention preferably comprise from about 8 to about 30 nucleobases(i.e., from about 8 to about 30 linked bases), although both longer andshorter sequences may find use with the present invention. Particularlypreferred antisense compounds are antisense oligonucleotides, even morepreferably those comprising from about 12 to about 25 nucleobases.

Specific examples of preferred antisense compounds useful with thepresent invention include oligonucleotides containing modified backbonesor non-natural internucleoside linkages. As defined in thisspecification, oligonucleotides having modified backbones include thosethat retain a phosphorus atom in the backbone and those that do not havea phosphorus atom in the backbone. For the purposes of thisspecification, modified oligonucleotides that do not have a phosphorusatom in their internucleoside backbone can also be considered to beoligonucleosides.

Preferred modified oligonucleotide backbones include, for example,phosphorothioates, chiral phosphorothioates, phosphorodithioates,phosphotriesters, aminoalkylphosphotriesters, methyl and other alkylphosphonates including 3′-alkylene phosphonates and chiral phosphonates,phosphinates, phosphoramidates including 3′-amino phosphoramidate andaminoalkylphosphoramidates, thionophosphoramidates,thionoalkylphosphonates, thionoalkylphosphotriesters, andboranophosphates having normal 3′-5′ linkages, 2′-5′ linked analogs ofthese, and those having inverted polarity wherein the adjacent pairs ofnucleoside units are linked 3′-5′ to 5′-3′ or 2′-5′ to 5′-2′. Varioussalts, mixed salts and free acid forms are also included.

Preferred modified oligonucleotide backbones that do not include aphosphorus atom therein have backbones that are formed by short chainalkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkylor cycloalkyl internucleoside linkages, or one or more short chainheteroatomic or heterocyclic internucleoside linkages. These includethose having morpholino linkages (formed in part from the sugar portionof a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfonebackbones; formacetyl and thioformacetyl backbones; methylene formacetyland thioformacetyl backbones; alkene containing backbones; sulfamatebackbones; methyleneimino and methylenehydrazino backbones; sulfonateand sulfonamide backbones; amide backbones; and others having mixed N,O, S and CH₂ component parts.

In other preferred oligonucleotide mimetics, both the sugar and theinternucleoside linkage (i.e., the backbone) of the nucleotide units arereplaced with novel groups. The base units are maintained forhybridization with an appropriate nucleic acid target compound. One sucholigomeric compound, an oligonucleotide mimetic that has been shown tohave excellent hybridization properties, is referred to as a peptidenucleic acid (PNA). In PNA compounds, the sugar-backbone of anoligonucleotide is replaced with an amide containing backbone, inparticular an aminoethylglycine backbone. The nucleobases are retainedand are bound directly or indirectly to aza nitrogen atoms of the amideportion of the backbone. Representative United States patents that teachthe preparation of PNA compounds include, but are not limited to, U.S.Pat. Nos. 5,539,082; 5,714,331; and 5,719,262, each of which is hereinincorporated by reference. Further teaching of PNA compounds can befound in Nielsen et al., Science 254:1497 (1991).

Most preferred embodiments of the invention are oligonucleotides withphosphorothioate backbones and oligonucleosides with heteroatombackbones, and in particular —CH₂, —NH—O—CH₂—, —CH₂—N(CH₃)—O—CH₂—[knownas a methylene (methylimino) or NMI backbone], —CH₂—O—N(CH₃)—CH₂—,—CH₂—N(CH₃)—N(CH₃)—CH₂—, and —O—N(CH₃)—CH₂—CH₂—[wherein the nativephosphodiester backbone is represented as —O—P—O—CH₂—] of the abovereferenced U.S. Pat. No. 5,489,677, and the amide backbones of the abovereferenced U.S. Pat. No. 5,602,240. Also preferred are oligonucleotideshaving morpholino backbone structures of the above-referenced U.S. Pat.No. 5,034,506.

Modified oligonucleotides may also contain one or more substituted sugarmoieties. Preferred oligonucleotides comprise one of the following atthe 2′ position: OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S-or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynylmay be substituted or unsubstituted C₁ to C₁₀ alkyl or C₂ to C₁₀ alkenyland alkynyl. Particularly preferred are O[(CH₂)_(n)O]_(m)CH₃,O(CH₂)_(n)OCH₃, O(CH₂)_(n)NH₂, O(CH₂)_(n)CH₃, O(CH₂)_(n)ONH₂, andO(CH₂)_(n)ON[(CH₂)_(n)CH₃)]2, where n and m are from 1 to about 10.Other preferred oligonucleotides comprise one of the following at the 2′position: C₁ to C₁₀ lower alkyl, substituted lower alkyl, alkaryl,aralkyl, O-alkaryl or O-aralkyl, SH, SCH₃, OCN, Cl, Br, CN, CF₃, OCF₃,SOCH₃, SO₂CH₃, ONO₂, NO₂, N₃, NH₂, heterocycloalkyl, heterocycloalkaryl,aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleavinggroup, a reporter group, an intercalator, a group for improving thepharmacokinetic properties of an oligonucleotide, or a group forimproving the pharmaco dynamic properties of an oligonucleotide, andother substituents having similar properties. A preferred modificationincludes 2′-methoxyethoxy (2′-O—CH₂CH₂OCH₃, also known as2′-O-(2-methoxyethyl) or 2′-MOE) (Martin et al., Helv. Chim. Acta 78:486[1995]) i.e., an alkoxyalkoxy group. A further preferred modificationincludes 2′-dimethylaminooxyethoxy (i.e., a O(CH₂)₂ON(CH₃)₂ group), alsoknown as 2′-DMAOE, and 2′-dimethylaminoethoxyethoxy (also known in theart as 2′-O-dimethylaminoethoxyethyl or 2′-DMAEOE), i.e.,2′-O—CH₂—O—CH₂—N(CH₂)₂.

Other preferred modifications include 2′-methoxy(2′-O—CH₃),2′-aminopropoxy(2′-OCH₂CH₂CH₂NH₂) and 2′-fluoro (2′-F). Similarmodifications may also be made at other positions on theoligonucleotide, particularly the 3′ position of the sugar on the 3′terminal nucleotide or in 2′-5′ linked oligonucleotides and the 5′position of 5′ terminal nucleotide. Oligonucleotides may also have sugarmimetics such as cyclobutyl moieties in place of the pentofuranosylsugar.

Oligonucleotides may also include nucleobase (often referred to in theart simply as “base”) modifications or substitutions. As used herein,“unmodified” or “natural” nucleobases include the purine bases adenine(A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C)and uracil (U). Modified nucleobases include other synthetic and naturalnucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine,xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkylderivatives of adenine and guanine, 2-propyl and other alkyl derivativesof adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine,5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil,cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo,8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substitutedadenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyland other 5-substituted uracils and cytosines, 7-methylguanine and7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Furthernucleobases include those disclosed in U.S. Pat. No. 3,687,808. Certainof these nucleobases are particularly useful for increasing the bindingaffinity of the oligomeric compounds of the invention. These include5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6substituted purines, including 2-aminopropyladenine, 5-propynyluraciland 5-propynylcytosine. 5-methylcytosine substitutions have been shownto increase nucleic acid duplex stability by 0.6-1.2. degree ° C. andare presently preferred base substitutions, even more particularly whencombined with 2′-O-methoxyethyl sugar modifications.

Another modification of the oligonucleotides of the present inventioninvolves chemically linking to the oligonucleotide one or more moietiesor conjugates that enhance the activity, cellular distribution orcellular uptake of the oligonucleotide. Such moieties include but arenot limited to lipid moieties such as a cholesterol moiety, cholic acid,a thioether, (e.g., hexyl-S-tritylthiol), a thiocholesterol, analiphatic chain, (e.g., dodecandiol or undecyl residues), aphospholipid, (e.g., di-hexadecyl-rac-glycerol or triethylammonium1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate), a polyamine or apolyethylene glycol chain or adamantane acetic acid, a palmityl moiety,or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety.

One skilled in the relevant art knows well how to generateoligonucleotides containing the above-described modifications. Thepresent invention is not limited to the antisense oligonucleotidesdescribed above. Any suitable modification or substitution may beutilized.

It is not necessary for all positions in a given compound to beuniformly modified, and in fact more than one of the aforementionedmodifications may be incorporated in a single compound or even at asingle nucleoside within an oligonucleotide. The present invention alsoincludes antisense compounds that are chimeric compounds. “Chimeric”antisense compounds or “chimeras,” in the context of the presentinvention, are antisense compounds, particularly oligonucleotides, whichcontain two or more chemically distinct regions, each made up of atleast one monomer unit, i.e., a nucleotide in the case of anoligonucleotide compound. These oligonucleotides typically contain atleast one region wherein the oligonucleotide is modified so as to conferupon the oligonucleotide increased resistance to nuclease degradation,increased cellular uptake, and/or increased binding affinity for thetarget nucleic acid. An additional region of the oligonucleotide mayserve as a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNAhybrids. By way of example, RNaseH is a cellular endonuclease thatcleaves the RNA strand of an RNA:DNA duplex. Activation of RNase H,therefore, results in cleavage of the RNA target, thereby greatlyenhancing the efficiency of oligonucleotide inhibition of geneexpression. Consequently, comparable results can often be obtained withshorter oligonucleotides when chimeric oligonucleotides are used,compared to phosphorothioate deoxyoligonucleotides hybridizing to thesame target region. Cleavage of the RNA target can be routinely detectedby gel electrophoresis and, if necessary, associated nucleic acidhybridization techniques known in the art.

Chimeric antisense compounds of the present invention may be formed ascomposite structures of two or more oligonucleotides, modifiedoligonucleotides, oligonucleosides and/or oligonucleotide mimetics asdescribed above.

The present invention also includes pharmaceutical compositions andformulations that include the antisense compounds of the presentinvention as described below.

C. RNAi Therapies

In other embodiments, RNAi is used to regulate expression of tumorantigens or cancer markers of the present invention. RNAi represents anevolutionary conserved cellular defense for controlling the expressionof foreign genes in most eukaryotes, including humans. RNAi is triggeredby double-stranded RNA (dsRNA) and causes sequence-specific mRNAdegradation of single-stranded target RNAs homologous in response todsRNA. The mediators of mRNA degradation are small interfering RNAduplexes (siRNAs), which are normally produced from long dsRNA byenzymatic cleavage in the cell. siRNAs are generally approximatelytwenty-one nucleotides in length (e.g. 21-23 nucleotides in length), andhave a base-paired structure characterized by two nucleotide3′-overhangs. Following the introduction of a small RNA, or RNAi, intothe cell, it is believed the sequence is delivered to an enzyme complexcalled RISC (RNA-induced silencing complex). RISC recognizes the targetand cleaves it with an endonuclease. It is noted that if larger RNAsequences are delivered to a cell, RNase III enzyme (Dicer) convertslonger dsRNA into 21-23 nt ds siRNA fragments.

Chemically synthesized siRNAs have become powerful reagents forgenome-wide analysis of mammalian gene function in cultured somaticcells. Beyond their value for validation of gene function, siRNAs alsohold great potential as gene-specific therapeutic agents (Tuschl andBorkhardt, Molecular Intervent. 2002; 2(3):158-67, herein incorporatedby reference).

The transfection of siRNAs into animal cells results in the potent,long-lasting post-transcriptional silencing of specific genes (Caplen etal, Proc Natl Acad Sci U.S.A. 2001; 98: 9742-7; Elbashir et al., Nature.2001; 411:494-8; Elbashir et al., Genes Dev. 2001; 15: 188-200; andElbashir et al., EMBO J. 2001; 20: 6877-88, all of which are hereinincorporated by reference). Methods and compositions for performing RNAiwith siRNAs are described, for example, in U.S. Pat. No. 6,506,559,herein incorporated by reference.

siRNAs are extraordinarily effective at lowering the amounts of targetedRNA, and by extension proteins, frequently to undetectable levels. Thesilencing effect can last several months, and is extraordinarilyspecific, because one nucleotide mismatch between the target RNA and thecentral region of the siRNA is frequently sufficient to preventsilencing Brummelkamp et al, Science 2002; 296:550-3; and Holen et al,Nucleic Acids Res. 2002; 30:1757-66, both of which are hereinincorporated by reference.

C. Genetic Therapies

The present invention contemplates the use of any genetic manipulationfor use in modulating the expression of cancer markers (e.g., BRD2,eIF4G1, RPL22, RPL13A, HES1, hypothetical protein XP_(—)373908,ubiquilin 1, nucleolar protein 3 (NOL3), alpha-2-glycoprotein 1 and heatshock 70 kDa protein 8 (HSPA70)) of the present invention. Examples ofgenetic manipulation include, but are not limited to, gene knockout(e.g., removing the cancer marker gene from the chromosome using, forexample, recombination), expression of antisense constructs with orwithout inducible promoters, and the like. Delivery of nucleic acidconstruct to cells in vitro or in vivo may be conducted using anysuitable method. A suitable method is one that introduces the nucleicacid construct into the cell such that the desired event occurs (e.g.,expression of an antisense construct).

Introduction of molecules carrying genetic information into cells isachieved by any of various methods including, but not limited to,directed injection of naked DNA constructs, bombardment with goldparticles loaded with said constructs, and macromolecule mediated genetransfer using, for example, liposomes, biopolymers, and the like.Preferred methods use gene delivery vehicles derived from viruses,including, but not limited to, adenoviruses, retroviruses, vacciniaviruses, and adeno-associated viruses. Because of the higher efficiencyas compared to retroviruses, vectors derived from adenoviruses are thepreferred gene delivery vehicles for transferring nucleic acid moleculesinto host cells in vivo. Adenoviral vectors have been shown to providevery efficient in vivo gene transfer into a variety of solid tumors inanimal models and into human solid tumor xenografts in immune-deficientmice. Examples of adenoviral vectors and methods for gene transfer aredescribed in PCT publications WO 00/12738 and WO 00/09675 and U.S.patent application Nos. 6,033,908, 6,019,978, 6,001,557, 5,994,132,5,994,128, 5,994,106, 5,981,225, 5,885,808, 5,872,154, 5,830,730, and5,824,544, each of which is herein incorporated by reference in itsentirety.

Vectors may be administered to subject in a variety of ways. Forexample, in some embodiments of the present invention, vectors areadministered into tumors or tissue associated with tumors using directinjection. In other embodiments, administration is via the blood orlymphatic circulation (See e.g., PCT publication 99/02685 hereinincorporated by reference in its entirety). Exemplary dose levels ofadenoviral vector are preferably 10⁸ to 10¹¹ vector particles added tothe perfusate.

V. Pharmaceutical Compositions

In some embodiments, the present invention provides pharmaceuticalcompositions that may comprise all or portions of tumor antigen orcancer marker polynucleotide sequences, tumor antigen polypeptides,inhibitors or antagonists of tumor antigen bioactivity, includingantibodies, alone or in combination with at least one other agent, suchas a stabilizing compound, and may be administered in any sterile,biocompatible pharmaceutical carrier, including, but not limited to,saline, buffered saline, dextrose, and water. The pharmaceuticalcompositions find use as therapeutic agents and vaccines for thetreatment of cancer.

The methods of the present invention find use in treating cancers asdescribed in greater detail above. Antibodies can be administered to thepatient intravenously in a pharmaceutically acceptable carrier such asphysiological saline. Standard methods for intracellular delivery ofantibodies can be used (e.g., delivery via liposome). Such methods arewell known to those of ordinary skill in the art. The formulations ofthis invention are useful for parenteral administration, such asintravenous, subcutaneous, intramuscular, and intraperitoneal.

As is well known in the medical arts, dosages for any one patientdepends upon many factors, including the patient's size, body surfacearea, age, the particular compound to be administered, sex, time androute of administration, general health, and interaction with otherdrugs being concurrently administered.

Accordingly, in some embodiments of the present invention, compositions(e.g., antibodies and vaccines) can be administered to a patient alone,or in combination with other nucleotide sequences, drugs or hormones orin pharmaceutical compositions where it is mixed with excipient(s) orother pharmaceutically acceptable carriers. In one embodiment of thepresent invention, the pharmaceutically acceptable carrier ispharmaceutically inert. In another embodiment of the present invention,compositions may be administered alone to individuals suffering fromcancer.

Depending on the type of cancer being treated, these pharmaceuticalcompositions may be formulated and administered systemically or locally.Techniques for formulation and administration may be found in the latestedition of “Remington's Pharmaceutical Sciences” (Mack Publishing Co,Easton Pa.). Suitable routes may, for example, include oral ortransmucosal administration; as well as parenteral delivery, includingintramuscular, subcutaneous, intramedullary, intrathecal,intraventricular, intravenous, intraperitoneal, or intranasaladministration.

For injection, the pharmaceutical compositions of the invention may beformulated in aqueous solutions, preferably in physiologicallycompatible buffers such as Hanks' solution, Ringer's solution, orphysiologically buffered saline. For tissue or cellular administration,penetrants appropriate to the particular barrier to be permeated areused in the formulation. Such penetrants are generally known in the art.

In other embodiments, the pharmaceutical compositions of the presentinvention can be formulated using pharmaceutically acceptable carrierswell known in the art in dosages suitable for oral administration. Suchcarriers enable the pharmaceutical compositions to be formulated astablets, pills, capsules, liquids, gels, syrups, slurries, suspensionsand the like, for oral or nasal ingestion by a patient to be treated.

Pharmaceutical compositions suitable for use in the present inventioninclude compositions wherein the active ingredients are contained in aneffective amount to achieve the intended purpose. For example, aneffective amount of antibody or vaccine may be that amount thatdecreases the presence of cancerous cells (e.g., shrinks or eliminates atumor or reduces the number of circulating cancer cells). Determinationof effective amounts is well within the capability of those skilled inthe art, especially in light of the disclosure provided herein.

In addition to the active ingredients these pharmaceutical compositionsmay contain suitable pharmaceutically acceptable carriers comprisingexcipients and auxiliaries that facilitate processing of the activecompounds into preparations that can be used pharmaceutically. Thepreparations formulated for oral administration may be in the form oftablets, dragees, capsules, or solutions.

The pharmaceutical compositions of the present invention may bemanufactured in a manner that is itself known (e.g., by means ofconventional mixing, dissolving, granulating, dragee-making, levigating,emulsifying, encapsulating, entrapping or lyophilizing processes).

Pharmaceutical formulations for parenteral administration includeaqueous solutions of the active compounds in water-soluble form.Additionally, suspensions of the active compounds may be prepared asappropriate oily injection suspensions. Suitable lipophilic solvents orvehicles include fatty oils such as sesame oil, or synthetic fatty acidesters, such as ethyl oleate or triglycerides, or liposomes. Aqueousinjection suspensions may contain substances that increase the viscosityof the suspension, such as sodium carboxymethyl cellulose, sorbitol, ordextran. Optionally, the suspension may also contain suitablestabilizers or agents that increase the solubility of the compounds toallow for the preparation of highly concentrated solutions.

Pharmaceutical preparations for oral use can be obtained by combiningthe active compounds with solid excipient, optionally grinding aresulting mixture, and processing the mixture of granules, after addingsuitable auxiliaries, if desired, to obtain tablets or dragee cores.Suitable excipients are carbohydrate or protein fillers such as sugars,including lactose, sucrose, mannitol, or sorbitol; starch from corn,wheat, rice, potato, etc; cellulose such as methyl cellulose,hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; andgums including arabic and tragacanth; and proteins such as gelatin andcollagen. If desired, disintegrating or solubilizing agents may beadded, such as the cross-linked polyvinyl pyrrolidone, agar, alginicacid or a salt thereof such as sodium alginate.

Dragee cores are provided with suitable coatings such as concentratedsugar solutions, which may also contain gum arabic, talc,polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titaniumdioxide, lacquer solutions, and suitable organic solvents or solventmixtures. Dyestuffs or pigments may be added to the tablets or drageecoatings for product identification or to characterize the quantity ofactive compound, (i.e., dosage).

Pharmaceutical preparations that can be used orally include push-fitcapsules made of gelatin, as well as soft, sealed capsules made ofgelatin and a coating such as glycerol or sorbitol. The push-fitcapsules can contain the active ingredients mixed with filler or binderssuch as lactose or starches, lubricants such as talc or magnesiumstearate, and, optionally, stabilizers. In soft capsules, the activecompounds may be dissolved or suspended in suitable liquids, such asfatty oils, liquid paraffin, or liquid polyethylene glycol with orwithout stabilizers.

Compositions comprising a compound of the invention formulated in apharmaceutical acceptable carrier may be prepared, placed in anappropriate container, and labeled for treatment of an indicatedcondition. For antibodies to a tumor antigen of the present invention,conditions indicated on the label may include treatment of conditionsrelated to cancer.

The pharmaceutical composition may be provided as a salt and can beformed with many acids, including but not limited to hydrochloric,sulfuric, acetic, lactic, tartaric, malic, succinic, etc. Salts tend tobe more soluble in aqueous or other protonic solvents that are thecorresponding free base forms. In other cases, the preferred preparationmay be a lyophilized powder in 1 mM-50 mM histidine, 0.1%-2% sucrose,2%-7% mannitol at a pH range of 4.5 to 5.5 that is combined with bufferprior to use.

For any compound used in the method of the invention, thetherapeutically effective dose can be estimated initially from cellculture assays. Then, preferably, dosage can be formulated in animalmodels (particularly murine models) to achieve a desirable circulatingconcentration range that adjusts antibody levels.

A therapeutically effective dose refers to that amount of antibody thatameliorates symptoms of the disease state. Toxicity and therapeuticefficacy of such compounds can be determined by standard pharmaceuticalprocedures in cell cultures or experimental animals, e.g., fordetermining the LD₅₀ (the dose lethal to 50% of the population) and theED₅₀ (the dose therapeutically effective in 50% of the population). Thedose ratio between toxic and therapeutic effects is the therapeuticindex, and it can be expressed as the ratio LD₅₀/ED₅₀. Compounds thatexhibit large therapeutic indices are preferred. The data obtained fromthese cell culture assays and additional animal studies can be used informulating a range of dosage for human use. The dosage of suchcompounds lies preferably within a range of circulating concentrationsthat include the ED₅₀ with little or no toxicity. The dosage varieswithin this range depending upon the dosage form employed, sensitivityof the patient, and the route of administration.

The exact dosage is chosen by the individual physician in view of thepatient to be treated. Dosage and administration are adjusted to providesufficient levels of the active moiety or to maintain the desiredeffect. Additional factors which may be taken into account include theseverity of the disease state; age, weight, and gender of the patient;diet, time and frequency of administration, drug combination(s),reaction sensitivities, and tolerance/response to therapy. Long actingpharmaceutical compositions might be administered every 3 to 4 days,every week, or once every two weeks depending on half-life and clearancerate of the particular formulation.

Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to atotal dose of about 1 g, depending upon the route of administration.Guidance as to particular dosages and methods of delivery is provided inthe literature (See, U.S. Pat. Nos. 4,657,760; 5,206,344; or 5,225,212,all of which are herein incorporated by reference).

In some embodiments, the pharmaceutical compositions of the presentinvention further include one or more agents useful in the treatment ofcancer. For example, in some embodiments, one or more antibodies orvaccines are combined with a chemotherapeutic agent. Chemotherapeuticagents are well known to those of skill in the art. Examples of suchchemotherapeutics include alkylating agents, antibiotics,antimetabolitic agents, plant-derived agents, and hormones. Among thesuitable alkylating agents are nitrogen mustards, such ascyclophosphamide, aziridines, alkyl alkone sulfonates, nitrosoureas,nonclassic alkylating agents, such as dacarbazine, and platinumcompounds, such as carboplatin and cisplatin. Among the suitableantibiotic agents are dactinomycin, bleomycin, mitomycin C, plicamycin,and the anthracyclines, such as doxorubicin (also known as adriamycin)and mitoxantrone. Among the suitable antimetabolic agents are antifols,such as methotrexate, purine analogues, pyrimidine analogues, such as5-fluorouracil (5-FU) and cytarabine, enzymes, such as theasparaginases, and synthetic agents, such as hydroxyurea. Among thesuitable plant-derived agents are vinca alkaloids, such as vincristineand vinblastine, taxanes, epipodophyllotoxins, such as etoposide, andcamptothecan. Among suitable hormones are steroids. Currently, thepreferred drug is adriamycin. However, other suitable chemotherapeuticagents, including additional agents within the groups of agentsidentified above, may be readily determined by one of skill in the artdepending upon the type of cancer being treated, the condition of thehuman or veterinary patient, and the like.

Suitable dosages for the selected chemotherapeutic agent are known tothose of skill in the art. One of skill in the art can readily adjustthe route of administration, the number of doses received, the timing ofthe doses, and the dosage amount, as needed. Such a dose, which may bereadily adjusted depending upon the particular drug or agent selected,may be administered by any suitable route, including but not limited to,those described above. Doses may be repeated as needed.

VI. Transgenic Animals Expressing Cancer Marker Genes or Knockouts

The present invention contemplates the generation of transgenic animalscomprising an exogenous cancer marker or tumor antigen (BRD2, eIF4G1,RPL22, RPL13A, HES1, hypothetical protein XP_(—)373908, ubiquilin 1,nucleolar protein 3 (NOL3), alpha-2-glycoprotein 1 and heat shock 70 kDaprotein 8 (HSPA70)) gene of the present invention or mutants andvariants thereof (e.g., truncations or single nucleotide polymorphisms).In other embodiments, the transgenic animals comprise a knock-out of acancer marker or tumor antigen gene. In preferred embodiments, thetransgenic animal displays an altered phenotype (e.g., increased ordecreased presence of markers) as compared to wild-type animals. Methodsfor analyzing the presence or absence of such phenotypes include but arenot limited to, those disclosed herein. In some preferred embodiments,the transgenic animals further display an increased or decreased growthof tumors or evidence of cancer.

The transgenic animals of the present invention find use in drug (e.g.,cancer therapy) screens. In some embodiments, test compounds (e.g., adrug that is suspected of being useful to treat cancer) and controlcompounds (e.g., a placebo) are administered to the transgenic animalsand the control animals and the effects evaluated.

The transgenic animals can be generated via a variety of methods. Insome embodiments, embryonal cells at various developmental stages areused to introduce transgenes for the production of transgenic animals.Different methods are used depending on the stage of development of theembryonal cell. The zygote is the best target for micro-injection. Inthe mouse, the male pronucleus reaches the size of approximately 20micrometers in diameter that allows reproducible injection of 1-2picoliters (p1) of DNA solution. The use of zygotes as a target for genetransfer has a major advantage in that in most cases the injected DNAwill be incorporated into the host genome before the first cleavage(Brinster et al., Proc. Natl. Acad. Sci. USA 82:4438-4442 [1985]). As aconsequence, all cells of the transgenic non-human animal will carry theincorporated transgene. This will in general also be reflected in theefficient transmission of the transgene to offspring of the foundersince 50% of the germ cells will harbor the transgene. U.S. Pat. No.4,873,191 describes a method for the micro-injection of zygotes; thedisclosure of this patent is incorporated herein in its entirety.

In other embodiments, retroviral infection is used to introducetransgenes into a non-human animal. In some embodiments, the retroviralvector is utilized to transfect oocytes by injecting the retroviralvector into the perivitelline space of the oocyte (U.S. Pat. No.6,080,912, incorporated herein by reference). In other embodiments, thedeveloping non-human embryo can be cultured in vitro to the blastocyststage. During this time, the blastomeres can be targets for retroviralinfection (Janenich, Proc. Natl. Acad. Sci. USA 73:1260 [1976]).Efficient infection of the blastomeres is obtained by enzymatictreatment to remove the zona pellucida (Hogan et al., in Manipulatingthe Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold SpringHarbor, N.Y. [1986]). The viral vector system used to introduce thetransgene is typically a replication-defective retrovirus carrying thetransgene (Jahner et al., Proc. Natl. Acad. Sci. USA 82:6927 [1985]).Transfection is easily and efficiently obtained by culturing theblastomeres on a monolayer of virus-producing cells (Stewart, et al.,EMBO J., 6:383 [1987]). Alternatively, infection can be performed at alater stage. Virus or virus-producing cells can be injected into theblastocoele (Jahner et al., Nature 298:623 [1982]). Most of the founderswill be mosaic for the transgene since incorporation occurs only in asubset of cells that form the transgenic animal. Further, the foundermay contain various retroviral insertions of the transgene at differentpositions in the genome that generally will segregate in the offspring.In addition, it is also possible to introduce transgenes into thegermline, albeit with low efficiency, by intrauterine retroviralinfection of the midgestation embryo (Jahner et al., supra [1982]).Additional means of using retroviruses or retroviral vectors to createtransgenic animals known to the art involve the micro-injection ofretroviral particles or mitomycin C-treated cells producing retrovirusinto the perivitelline space of fertilized eggs or early embryos (PCTInternational Application WO 90/08832 [1990], and Haskell and Bowen,Mol. Reprod. Dev., 40:386 [1995]).

In other embodiments, the transgene is introduced into embryonic stemcells and the transfected stem cells are utilized to form an embryo. EScells are obtained by culturing pre-implantation embryos in vitro underappropriate conditions (Evans et al., Nature 292:154 [1981]; Bradley etal., Nature 309:255 [1984]; Gossler et al., Proc. Acad. Sci. USA 83:9065[1986]; and Robertson et al., Nature 322:445 [1986]). Transgenes can beefficiently introduced into the ES cells by DNA transfection by avariety of methods known to the art including calcium phosphateco-precipitation, protoplast or spheroplast fusion, lipofection andDEAE-dextran-mediated transfection. Transgenes may also be introducedinto ES cells by retrovirus-mediated transduction or by micro-injection.Such transfected ES cells can thereafter colonize an embryo followingtheir introduction into the blastocoel of a blastocyst-stage embryo andcontribute to the germ line of the resulting chimeric animal (forreview, See, Jaenisch, Science 240:1468 [1988]). Prior to theintroduction of transfected ES cells into the blastocoel, thetransfected ES cells may be subjected to various selection protocols toenrich for ES cells which have integrated the transgene assuming thatthe transgene provides a means for such selection. Alternatively, thepolymerase chain reaction may be used to screen for ES cells that haveintegrated the transgene. This technique obviates the need for growth ofthe transfected ES cells under appropriate selective conditions prior totransfer into the blastocoel.

In still other embodiments, homologous recombination is utilized toknock-out gene function or create deletion mutants (e.g., truncationmutants). Methods for homologous recombination are described in U.S.Pat. No. 5,614,396, incorporated herein by reference.

EXPERIMENTAL

The following examples are provided in order to demonstrate and furtherillustrate certain preferred embodiments and aspects of the presentinvention and are not to be construed as limiting the scope thereof.

In the experimental disclosure which follows, the followingabbreviations apply: N (normal); M (molar); mM (millimolar); μM(micromolar); mol (moles); mmol (millimoles); μmol (micromoles); nmol(nanomoles); pmol (picomoles); g (grams); mg (milligrams); μg(micrograms); ng (nanograms); 1 or L (liters); ml (milliliters); μl(microliters); cm (centimeters); mm (millimeters); μm (micrometers); nm(nanometers); and ° C. (degrees Centigrade).

Example 1

Phage Array Profiling of Prostate Cancer

This Example describes a phage array profiling method of the presentinvention as applied to prostate cancer.

A. Methods

Patient Population and Samples. At the time of diagnosis and prior toradical prostatectomy, sera from biopsy-proven clinically localizedprostate cancer participants were collected by the University ofMichigan Specialized Research Program in Prostate Cancer (SPORE)tissue/serum bank between January 1995 to January of 2003. The averageage of all prostate cancer patients was 59.6 (range 41-74). Forpost-prostatectomy prostate cancer patients, the average age and PSAvalue were 58.1 and 0.169 ng/ml respectively. Sera from lungadenocarcinoma patients (average age 53.9) without any known history ofprostate cancer were used. As controls, serum samples from 85age-matched males (average age 62.5, range 50-80) with no known historyof cancer were used for the study. All sera were stored in aliquots at−20° C. until use.Construction of T7 phage display prostate cancer cDNA libraries. TotalRNA was isolated separately from six prostate cancer tissue samplesaccording to the standard Trizol protocol (Dhanasekaran et al., Nature412, 822-826. (2001)). The integrity of each RNA preparation wasassessed by confirming that the A₂₆₀/A₂₈₀ ratio was greater than 1.8 andgel electrophoresis. Equal amounts of total RNA from six tissues werecombined to make a pool. Poly(A) RNA was purified from the total RNApool following Straight A's mRNA Isolation System protocol (Novagen). Atotal of 8.7 μg of mRNA was eluted and its integrity was judged by gelelectrophoresis.

OrientExpression cDNA Synthesis and Cloning System (Novagen) was usedfor the construction of the T7 phage prostate cancer cDNA libraries. Inorder to ensure the representation of both N-terminal and C-terminalamino acid sequences and eliminate the 3′ bias inherent fromoligo(dT)-primed strands, equal amounts of mRNA from each was used toconstruct two cDNA libraries using directional oligo(dT) primers andrandom primers in parallel.

After vector ligation and T7 packaging, two cDNA phage display librarieswere constructed and the library titers were determined by plaque assaywith 4.2×10⁶ pfu for the oligo(dT) primer library and 2.2×10⁶ pfu forthe random primer library, respectively. Phage particles from twolibraries were combined to make phage library pool. After amplification,glycerol was added and the libraries were stored at −80° C.

Amplification of Libraries. Five milliliters of LB with carbenicillinwas inoculated at 37° C. overnight with a single colony of BLT5615 froma freshly streaked plate. Overnight culture was added to 100 ml of LBwith carbenicillin and grew to an OD₆₀₀ of 0.5. One mM of IPTG was addedand the cells were allowed to grow for further 30 min. An appropriatevolume of culture was infected with phage library at multiplicity ofinfection (MOI) of 0.001-0.01 (i.e. 100-1000 cells for each pfu). Theinfected bacteria were incubated with shaking at 37° C. for 1-2 hr untillysis was observed. The phage lysate was then clarified by spinning at8000×g for 10 min. The supernatant is collected and stored at −80° C.Biopanning for Phage-Epitope Clones Specific for Prostate Cancer. Toenrich for phageepitopes that bind to IgGs specifically associated withprostate cancer, a positive and negative selection strategy wasperformed. First, a pre-clearing step was used to remove non-specificepitope-clones by pre-adsorbing the phage libraries onto purified IgGpool from 10 normal sera. Next, the pre-cleared phage libraries wereselected onto the pool of IgGs purified from the sera of 19 localizedprostate cancer patients. Protein A/G agarose beads (Pierce) were thenused to purify IgGs from the sera of prostate cancer patients. Briefly,10 μl protein-A/G agarose beads were placed into 1.5 ml eppendorf tubesand washed two times with 1×PBS. Washed beads were blocked with 1% BSAat 4° C. for 1 hr. The beads were then incubated at 4° C. with 15 μl ofindividual serum from control or prostate cancer patients at 1:50dilution in 1% BSA. After incubation overnight, the beads were washedwith 1×PBS by centrifuging at 1000 g for 2 min. After three washes, 10μl of 1×PBS was added to each tube, and 10 tubes of protein A/G-IgGcomplex from 10 control sera and 19 tubes of prostate cancer sera werecombined to make IgG pools of control and prostate cancer respectively.These control and prostate cancer IgG pools associated with protein A/Gbeads were stored at 4° C. as stocks for subsequent biopanning.

Twenty microliters of control IgG pool was incubated with 30 μlamplified phage library pool diluted at 1:40 with 10% BSA at 4° C. After2 hrs, the mixture was centrifuged at 1000 g for 2 min. The beads withnon-specifically bound phage particles were discarded, and thesupernatant was collected. Next, the supernatant was incubated with 30μl of the prostate cancer IgG pool at 4° C. overnight. The mixture wascentrifuged at 1000 g for 2 min and the supernatant was discarded. Toelute the bound phage, 100 μl of 1% SDS was added and incubated at roomtemperature for 10 min to break up the antibody-antigen reaction withoutdisrupting T7 phage particles. The bound phages were removed from thebeads by centrifuging at 5500 g for 8 min. Eluted phages weretransferred to 10 ml culture of BLT5615 cells for amplification. Fivecycles of affinity selections and biopanning were carried out forenrichment of prostate cancer-specific epitope phages.

Construction of the Phage-Epitope Microarrays. The phage library (˜10¹⁰pfu) from the fifth cycle of biopanning was diluted at 1:10⁸ and allowedto grow on LB agar plates with carbenicillin. A total number of 2300random phage colonies were picked and amplified in 96-well plates. Thephage lysates were spotted onto on FAST slides (Schleicher & Schuell) tomake high density phage epitope microarrays using a GMS 417 printer(Affymetrix). T7 phage without any cDNA insert and anti-human IgG at1:1000 dilution were spotted in triplicate as negative and positivecontrols, respectively. The arrays were dried overnight at roomtemperature. Before processing, the arrays were rinsed briefly in a 4%nonfat milk/PBS with 0.1% tween-20 to remove unbound phage, and thentransferred immediately to 4% nonfat milk/PBS as a blocking solution for1 hr at room temperature. Without allowing to dry, 2 ml of PBScontaining human serum and T7-tag antibody (Novagen) at a dilution of1:500 and 1:5000 respectively was applied to the surface of the slidesin a screw-top slide hybridization tube. To test the specificity of theimmune response, reactive serum was first quenched of non-specificactivity by pre-adsorbing with 50 fold higher amount (v/v) of bacteriallysate (OD600 of 0.5) and then used for incubation as described below.The arrays were incubated with sera from prostate cancer or controlindividuals for 1 hour at room temperature and then washed 5 times inPBS/0.1% Tween-20 solution for 5 min each. All washes were performed atroom temperature.

After washing, the arrays were incubated with 2 ml of PBS containingCy3-labeled goat antimouse antibody and Cy5-labeled goat anti-humanantibody (Jackson ImmunoResearch) at a dilution of 1:5,000 for both for1 hr in the dark. Five washes were performed using PBS/0.1% Tween-20solution with 5 mins each. The arrays were dried by centrifuging at 500g for 5 min and scanned.

Scanning and Primary Analysis of Phage-Epitope Microarrays. All slideswere scanned using 532 nm and 635 nm lasers (Axon Laboratories). Afterscanning, the array images were quantified using GenePix software (AxonLaboratories). According to the experimental design, the median ofCy5/Cy3 was utilized so as to control the small variations in the amountof phage epitope being spotted. Ratio of Cy5/Cy3 for each spot wassubtracted by median of Cy5/Cy3 of the negative T7 empty spots with theobservation that the signal for the T7 empty phage on each chip highlycorrelated with the signal intensity for whole array. A Z-transformationwas applied to clones so that the mean of each clone was zero acrossarrays and the standard deviation was 1.

Normalized data was subjected to two-way clustering analysis with use ofCluster and TreeView (Eisen et al., Proc Natl Acad Sci USA 95,14863-14868 (1998)). To filter the data, the criteria of at least 1observation with absolute values greater than 1.2 was applied and 186clones were selected. An unsupervised hierarchical clustering analysiswas performed with correlation (uncentered) similarity matrix andaverage linkage clustering.

Supervised Analysis of Humoral Immune Response Profiles. In order toefficiently screen hundreds of sera on phage epitope clones, a focusedprotein microarray comprised of 180 phage clones selected from theprimary analysis of high-density epitope microarrays described above wasutilized. This focused microarray included four T7 empty phages asnegative controls. By employing this small microarray platform, 129 seraincluded 59 sera from prostate cancer patients obtained prior toprostatectomy and 70 control sera from age-matched males were screenedas mentioned above.

The entire dataset from 129 samples was used to build a class predictionmodel by a leave one out cross-validation (LOOCV) strategy in geneticalgorithm/K-nearest neighbors (GA/KNN) (k=3 in this study) method (Li etal., 4, 727-739 (2001)). The raw phage-epitope microarray data wasnormalized as described for the high-density epitope microarrays. Thenormalized array data was then applied to GA for selection of featureepitopes and assessment of the relative predictive importance of theepitope by ranking them based on their frequency of occurrence in GAsolutions. Different numbers of the top-most epitopes were used to builda different KNN prediction model.

Prediction accuracy and error were calculated using LOOCV to evaluatethe performance of different KNN model. Finally, a top-ranked 22 cloneswere selected based on their best performance on specificity andsensitivity. Prediction sensitivity and specificity were computed basedon the number of misclassified samples in the cancer and control groups.

Class Prediction on Independent Data. A weighted voting scheme wasadopted to predict “test samples”, as described previously (Golub etal., Science 286, 531-537 (1999)). Briefly, each epitope in the featureset casts a weighted vote for a class 0 or 1: Vx=Tx (e_(x)−b_(x)) wheree_(x) is expression value of epitope x, T_(x) is the t-statistic forcomparing the two class means of epitope x in the training set, andb_(x) is (μ_(class0)+μ_(class1))/2. The final vote for class 0 or 1 issign (Σ_(x) V_(x)) and the prediction strength (PS) or confidence in theprediction of the winning class is(V_(win)−V_(lose))/(V_(win)+V_(lose)), where V_(i) is the votes forclass i.Statistical Analysis. Principal Components Analysis (PCA) (Crescenzi andGiuliani, FEBS Lett 507, 114-118 (2001)) was applied on the epitomicprofiles of the 22 phage clones. The first five components contained 90%of the variation in the data set and were subsequently used ascovariates in the logistic regression fitting cancer versus normal asbinary diagnostic outcome. Fitted probabilities were obtained and usedto generate the ROC curve to assess the prediction accuracy of theepitomic profile. All statistical analysis was performed with SPSS 11.1(SPSS Inc). The mean values for phage epitope humoral response werepresented as mean plots with the error bars signifying a 95% confidenceinterval of the mean. P values less than 0.05 were consideredstatistically significant.Sequence Analysis of Humoral Response Candidates. The top 22 phageepitope clones were amplified by PCR using T7 capsid forward and reverseprimers (Novagen). Briefly, 2 μl of fresh phage lysate with titer of˜10¹⁰ pfu was incubated with 100 μl of 10 mM EDTA, pH 8.0 at 60° C. for10 min. After centrifuging at 14,000×g for 3 min, 2 μl of denaturedphage was used for PCR in 100 μl volume of reaction under standardcondition. PCR products were confirmed on 1% agarose gel containingethidium bromide. After purifying with MultiScreen-FB filter plate(Millipore) following manufacturer's protocol, PCR products weresequenced using T7 capsid forward primer to determine the cDNA inserts.DNA sequence and potential protein sequence were aligned using NCBIBLAST.Development of an ELISA to Validate Humoral Response Candidates. ELISAswere developed for the phage epitopes to confirm their immunoreactivitywith different patient serum. Ninety-six well MAX-SORB microtiter plates(NUNC) were coated with 100 μl of diluted T7-tag antibody (Novagen)using 1× PBS at 1:1000 overnight at 4° C. on an orbital shaker. All theadditions were in 100 μl volumes unless otherwise mentioned. Dilutionsof serum and secondary detection reagents were carried out in 1:5 HPEbuffer (R&D systems). After washing 5 times with PBS/Tween-20 usingEL404 microplate autowasher (Bio-Tek), the plates were blocked firstwith 200 μl of 2% BSA/PBS for 2 hrs followed by 200 μl of superblock(Pierce) for 2 mins, both at room temperature. Phages and the T7 emptyphage as negative control were separately diluted at 1:25 to a finaltitration of ˜10⁹ pfu. After washing as above, the plate was incubatedwith 100 μl of diluted phages for 2 hrs at RT. Serially diluted (1:500,1:1000 and 1:2000) serum samples were added to each well, and incubatedfor 1 hr at RT. After washing, the plates were then incubated with1:10000 diluted HRP-conjugated anti-human IgG for 1 hr at RT. The plateswere then developed using 100 μl TMB substrate system (Sigma) for 30 minafter final washing. The reaction was stopped using 50 μl of 1.5 M H₂SO₄and read at 450 nm using ELx 800 universal microplate reader (Bio-Tek).Meta-Analysis of Gene Expression of Humoral Response Candidates. Thegene expression level of four genes, namely BRD2, eIF4G1, RPL13A andRPL22, were studied using ONCOMINE. Briefly, each gene was searched onthe database, and the results were filtered by selecting prostatecancer. The data from study class of benign prostate, prostate cancerand/or metastatic prostate cancer with p<0.05 were used to plot the boxplots with SPSS 11.1. P values for each group were calculated usingstudent t-test.Immunoblot Analysis. Tissues were homogenized in NP-40 lysis buffercontaining 50 mmol/L Tris-HCl, pH 7.4, 1% Nonidet P-40 (Sigma) andcomplete protease inhibitor cocktail (Roche). Fifteen μg of proteinextracts were mixed with SDS sample buffer and electrophoresed onto a4-15% linear gradient SDS-polyacrylamide gel under reducing conditions.The separated proteins were transferred onto polyvinyl difluoridemembranes (Amersham). The membranes were then incubated for 1 hour inblocking buffer (Tris-buffered saline with 0.1% Tween (TBS-T) and 5%nonfat dry milk). Membranes were incubated with purified eIF4G1 rabbitpolyclonal at 1:4000 dilution (Bethyl), RPL22 mouse monoclonal (BDbiosciences) at 1:400 dilution, BRD2 rabbit polyclonal (Abgent) dilutedat 1:400 and RPL13a rabbit polyclonal (kind gift of Dr. Paul Fox) usedat 1:4000 dilution and incubated overnight at 4° C. After washing threetimes with TBS-T buffer, the membrane was incubated with horseradishperoxidase-linked donkey anti-rabbit IgG or rabbit anti-mouse IgG HRPconjugate (Amersham) at 1:5000 for 1 hour at room temperature.

After washing the blots with TBS-T and TBS, the signals were visualizedwith the ECL detection system (Amersham) and autoradiography. To monitorequal loading, the membranes were incubated with anti-human GAPDHantibody (Abcam) at 1:25,000 dilution for two hours and the signals werevisualized.

Tissue microarray (TMA) and Immunohistochemistry. In order to determinethe expression of eIF4G1 protein in situ across a wide range of prostatetissues, a prostate cancer progression TMA composed of benign prostatetissue, localized prostate cancers and metastatic prostate cancer wasemployed. Antigen retrieval was carried out by heating the slides incitrate buffer pH 6.0 in a microwave oven for 15 minutes. Rabbitanti-eIF4G1 (Bethyl) antibodies were applied (1:100 dilution) andincubated for 1 hour at room temperature. Secondary anti-mouseantibodies avidin-conjugated were applied before washing. Enzymaticreaction was completed using a streptavidin biotin detection kit (Dako).Immunofluorescence and confocal microscopy. The prostate cancer tissuesection slides were soaked in xylene to remove paraffin. Antigen wasretrieved by heating the slides in citrate buffer pH6.0 for 15 minutesin a pressure cooker. The slides were then blocked in PBS-T with 5%normal donkey serum for 1 hour. A mixture of rabbit anti-eIF4G1 (Bethyl)antibody and mouse anti-Ecadherin (BD biosciences) antibody was added tothe slides at 1:40 and 1:250 dilutions respectively and incubated for 1hour at room temperature. Slides were then incubated with secondaryantibodies (anti-mouse Alexa 488 and anti-rabbit Alexa 555 at 1:1000dilution) were incubated for 1 hour. After washing the slides with PBS-Tand PBS, the slides were mounted using vectashield mounting mediumcontaining DAPI. Confocal images were taken with Ziess LSM510META (Carl Zeiss) imaging system using ultraviolet, Argon and HeliumNeon 1 light source. The triple color images were exported as TIFFimages and color balanced.B. Results

An overview of the method used in the present invention to identifyepitomic biomarkers of prostate cancer is described in FIG. 1. Todevelop a T7 phage display library for prostate cancer, RNA was isolatedfrom prostate cancer tissues derived from six patients with clinicallylocalized disease (three patients with Gleason grade 6 and threepatients with Gleason grade 7 prostate cancer). To generate a wide rangeof epitopes (both representing C-terminal and N terminal epitopes),parallel libraries were constructed using oligo(dT) and random primers.

Once packaged into the T7 phage system, epitopes from the library wereexpressed as a fusion protein with the capsid 10B protein on the surfaceof the phage. This serves as “bait” to capture potential autoantibodiesfound in serum. To enrich for epitopes that specifically generate ahumoral response in prostate cancer patients, the phage-epitopelibraries were subjected to five rounds of biopanning (FIG. 1). In orderto remove non-specific immunoreactivity, the phage epitope particleswere pre-adsorbed to a pool of immunoglobulins (IgG) isolated from tencontrol individuals. The “flow-thru” or nonbonding supernatant was thenenriched for prostate cancer-specific epitopes by incubating with IgGsfrom a pool of 19 patients with clinically localized prostate cancer(see FIGS. 4, 5, and 6 for clinical and pathological information forpatients). Protein A/G beads were used to isolate phage-epitopeparticles that specifically bound antibodies from prostate cancerpatients. The bound phages were eluted and amplified in bacteria, thuscompleting one round of biopanning (FIG. 1). After five rounds ofbiopanning, it is expected that the pool will be enriched for epitopesthat specifically elicit a humoral immune response in prostate cancerpatients. Approximately 2300 (2.3K) phage-epitope clones were selectedrandomly from the biopanned material in order to generate proteinmicroarrays. Once in a microarray format, these enriched phage epitopeclones are used to interrogate serum samples for humoral immune responsemarkers.

Using this 2.3K phage-epitope microarray, sera from prostate cancerpatients and controls was evaluated. A two-color system was used inwhich a green fluorescent dye (Cy3) was used to measure levels of thecapsid 10B fusion protein as a control for protein spotting, and a redfluorescent (Cy5) was used to measure levels of bound IgG (FIG. 1).Therefore, increased Cy5/Cy3 ratios represented varying levels of immunereactivity. As an initial discovery approach, 31 serum samplesconsisting of 20 sera from prostate cancer patients and 11 controls wereevaluated. Most of the sera from prostate cancer patients exhibitedantibody repertoires that reacted with phage-epitope clones on themicroarrays while most of the controls did not. After normalization, thedata was filtered for elements that have a Cy5/Cy3 ratio with anabsolute value greater than 1.2 in at least one of the serum samples.This resulted in 186 phage-epitope clones, which were used forsubsequent analyses. Using an unsupervised learning method, Cy5/Cy3values from these immunoreactive clones were hierarchically clustered.The sera from prostate cancer patients and those from controlssegregated into two predominant clusters. Samples in the clustercontaining primarily sera from prostate cancer patients, exhibited arobust humoral response to specific phage epitope clones (represented byintensities of yellow color). In this set of 31 sera there was onemis-classified sample from both the prostate cancer cohort as well asthe control group. This resulted in a sensitivity and specificity of 95%and 91%, respectively.

To expand the population of sera tested, a focused phage-epitopemicroarray consisting of the 180 of clones used in the unsupervisedanalysis (above) as well as additional control elements (i.e., T7 emptyphage) was developed. Using these focused protein microarrays, 129 serumsamples including 59 patients with biopsy-confirmed prostate cancer and70 controls were evaluated. Unsupervised analysis using the total 176epitope clones (excluding four negative clones) revealed 80% specificityand 83% sensitivity for 129 serum samples (see FIG. 7). To increase theclassification accuracy, a class prediction model was developed byemploying a non-parametric pattern recognition approach, GeneticAlgorithm (GA) combined with k-Nearest Neighbor (KNN), to discriminatedifferent serum samples. The predictive importance of each epitope forsample classification was evaluated and the epitopes were then rankedwith the top-most epitope assigned a rank of 1. Eleven different KNNclass prediction models were constructed using different numbers of thetop-most epitopes (10, 20-26, 30, 50, and 100 features) to evaluatetheir predictive performances by leave-one-out-cross-validation. Theprediction accuracy improved as more epitopes were involved in themodels, whereas too many epitopes introduced excess error in the modelthus decreasing the prediction accuracy. The 22 phage epitope clonesyielded the best performance in classifying the serum samples with 97%specificity (2 out of 70 controls misclassified) and 88% sensitivity (7out of 59 prostate cancer patients mis-classified). Thus, in asubstantially larger cohort of sera, it was possible to predict prostatecancer status based on the humoral response to 22 phage epitopes.

The receiver operator characteristics (ROC) of a multiplex panel ofhumoral response markers was next evaluated to assess predictionaccuracy. In order to develop an ROC curve, the 22 predictive phageepitope biomarkers were considered as covariates and the dimension ofthe dataset from humoral immune response was reduced by principalcomponents analysis (PCA). The first five components accounting for 90%of the variation were applied to logistic regression to predict prostatecancer versus control. The fitted probabilities from the logistic model(p<0.001 for the overall model) were used as threshold points tocalculate sensitivities and specificities (FIG. 2A). The area under thecurve equaled 0.95.

The 22 top discriminating clones identified by supervised analysis weresequenced. Six out of the 22 clones were found to be in-frame and inknown expressed sequences. These Six included Bromodomain ContainingProtein 2 (BRD2), Eukaryotic Translation Initiation Factor 4 Gamma 1(eIF4G1), Ribosomal Protein L22 (RPL22), Ribosomal Protein L13A(RPL13A), HES 1 (hairy and enhancer of split 1, homolog of Drosophila),and hypothetical protein XP_(—)373908. None of these proteins have beenassociated with prostate cancer previously as either an over-expressedprotein or as a humoral response target. Except hypothetical proteinXP_(—)373908, four of the in-frame phage-epitope clones wereintracellular proteins involved in regulating transcription ortranslation in rapidly growing cells. BRD2, also known as RING3, is anuclear transcription factor kinase known to be up-regulated in humanleukemias (Denis and Green, Genes Dev 10, 261-271 (1996); Denis et al.,Cell Growth Differ 11, 417-424 (2000)). BRD2 has been shown tospecifically interact with acetylated lysine 12 on histone H4 (Kanno etal., Mol Cell 13, 33-43 (2004)). Initiation factors of the eIF4 groupare important in the recognition of the 5′ cap region of messenger RNAs(mRNA) as well as unwinding of mRNA structure (Gingras et al., Genes Dev15, 807-826 (2001)). Among them, eIF4G1 plays a central role in theassembly of the preinitiation complex (Morino et al., Mol Cell Biol 20,468-477 (2000)). eIF4G1 has been shown to be overexpressed in head andneck squamous cell carcinoma (Cromer et al., Oncogene (2003)) andsquamous lung carcinoma patients (Bauer, C. et al. Int J Cancer 98,181-185 (2002); Bauer et al., Cancer 92, 822-829 (2001)) and produces ahumoral immune response (Brass et al., Hum Mol Genet 6, 33-39 (1997)).Overexpression of eIF4G1 has been shown to transform NIH3T3 cells(Fukuchi-Shimogori et al., Cancer Res 57, 5041-5044 (1997)). RPL22 andRPL13A are cytoplasmic ribosomal proteins that are the components of the60S subunit (Mazumder et al., Cell 115, 187-198 (2003)). RPL22 has beenshown to be overexpressed in lung cancer (Miura et al., Cancer Res 62,3244-3250 (2002); Racz et al., Eur J Cancer 35, 641-646 (1999)). RPL13awas identified as a candidate interferon-Gamma Activated Inhibitor ofTranslation (GAIT) and thus mediates transcript-specific translationalcontrol (Mazumder et al., supra). HES 1 is basic helix-loop-helixtranscription factor of the achaete-scute family. Human achaete-scuthomolog 1 (hASH 1) is highly expressed in neuroendocrine cancers such asmedullary thyroid cancer and small cell lung cancer. HES 1 genes encodehelix-loop-helix transcription repressors with structural homology tothe Drosophila hairy and Enhancer-to-split. HES1 protein is detected atabundant levels in most non-neuroendocrine human lung cancer cell lines.

The remaining 17 prostate cancer specific phage epitope clones wereeither in un-translated regions of expressed genes or out of frame inthe coding sequence of known genes (see FIGS. 11 and 12)). These cloneslikely represent “mimotopes” or epitopes that are structurally similarto expressed proteins but unrelated or weakly related at the proteinsequence level. Three of the remaining 17 discriminating clonesrepresented an epitope encoded by overlapping sequence from the 5′un-translated region (UTR) of the BMI1 gene (5′-UTR_BMI1), which is aPolycomb Group (PcG) protein implicated in various cellular processesincluding self-renewal (Park et al., Nature 423, 302-305 (2003);Molofsky et al., Nature 425, 962-967 (2003)). PcG proteins function asmulti-component complexes. Protein BLAST analysis of the peptidesequence shared by the three phage clones representing the 5′-UTR_BMI1identified significant homology (E value=5×10⁻⁴) to a glycine-richstretch of the androgen receptor (FIG. 12). Androgens are known to playan important role in prostate cancer progression (Singh and Figg, CancerBiol Ther 3 (2004); Taplin et al., J Cell Biochem 91, 483-190 (2004)).This was the only phage epitope clone picked up by the methods of thepresent invention that was represented by multiple independent clonessuggesting consistency and robustness of this humoral response inprostate cancer patients (FIGS. 2B, C). In 1985, Liao and Witte reportedthat that 37% of males and only 3% of females had significantautoantibodies to androgen receptor (Liao and Witte, Proc Natl Acad SciUSA 82, 8345-8348 (1985)). Males older than 66 more often hadhigher-titer autoantibodies to androgen receptor than younger males orfemales.

To validate the observations we made using phage-epitope proteinmicroarrays, an ELISA was generated using three of the phage epitopeclones including the 5′-UTR_BMI1, eIF4G1 and RPL22. Phage particles werepurified and coated onto 96-well plates for subsequent incubation withrepresentative sera from prostate cancer patients and controls. As shownin FIG. 2B, prostate cancer patients produce a humoral response to theseepitopes relative to controls. Titration of the humoral immune responseto the 5′-UTR_BMI1 clone is shown as a representative example in FIG.2C.

In order to validate the 22-clone epitomic profile, an independentcohort of sera from 48 clinically localized prostate cancer patients(pre-prostatectomy), 14 prostate cancer patients (post-prostatectomy),11 hormone refractory prostate cancer patients, 15 age-matched controlsand 10 lung cancer patients was employed. A prediction model was builtby a weighted voting algorithm using the 22 phage epitope profilederived from the “training” cohort of 129 samples (FIG. 8). As anindependent test cohort, a class prediction was made for 63 samples (48localized prostate cancer and 15 controls) using this model (FIG. 9). Intotal, only 2 out of 15 controls and 8 out of 42 cancers weremisclassified, which resulted in 87% specificity and 81% sensitivity. Anadditional 6 cancer samples were considered as unclassified due to a lowprediction strength (confidence) of 0.1 (See FIGS. 8, 9 and 10). Afterprostatectomy, the humoral response was generally decreased especiallyin patients that did not exhibit a recurrence suggesting that the immuneresponse is attenuated upon removal of the “immunogen”. 4/4 patientsthat exhibited PSA recurrence post-prostatectomy, also maintained the22-epitope humoral response. Only 3 out 11 patients withhormone-refractory disease exhibited a humoral response to the 22selected epitopes. This suggests that the humoral immune response isattenuated in advanced prostate cancer or those patients treated withanti-androgens and/or chemotherapeutics. To determine if this 22-epitopeprofile is specific to prostate cancer, sera from 10 lung cancerpatients was also examined. Only 2/10 sera from lung cancer patientsexhibited reactivity to the prostate cancer epitopes. This is incontrast to the over 80% sensitivity achieved for prostate cancerpatients using this platform, suggesting that the epitomic profile isprostate cancer-specific (proportion test, P<0.001).

To determine whether the four in-frame phage epitope clones (FIG. 3A)are dysregulated in prostate cancer, a meta-analysis of publiclyavailable prostate cancer gene expression data was performed (LaTulippeet al., Cancer Res 62, 4499-4506 (2002); Luo et al., Mol Carcinog 33,25-35 (2002); Luo et al., Cancer Res 61, 4683-4688. (2001); Singh etal., Cancer Cell 1, 203-209. (2002); Welsh et al., Cancer Res 61,5974-5978. (2001); Dhanasekaran et al., supra). This in silico analysissuggested there was ample evidence in multiple profiling studies forover-expression of the four in-frame phage epitope clones (FIG. 3B).Immunoblot analyses of benign prostate and prostate cancer tissueextracts demonstrated overexpression of these humoral responsecandidates at the protein level confirming the in silico analyses (FIG.3C).

To assess the expression of the humoral response candidates in situ,immunohistochemistry and immunofluorescence analysis was performed. Oneout of the four antibodies used for immunoblot analysis (FIG. 3C) werecompatible for tissue staining purposes. The antibody that wassuccessful for these applications was directed against the eIF4G1protein. Weak cytoplasmic staining of eIF4G1 was observed in benignprostate epithelia, and strong staining was observed in clinicallylocalized prostate cancer. These immunohistochemical analyses werefurther confirmed by immunofluorescence staining for eIF4G1. A strongcytoplasmic staining of eIF4G1 was observed in prostate cancer epitheliaas compared to negative staining in benign epithelia.

In summary, the present example describes a robust approach of combiningphage display with protein microarrays to detect cancer based on theendogenous humoral immune response. As this approach relies on amultiplex set of markers, it is less likely to suffer from the drawbacksof monitoring single biomarkers such as PSA.

Example 2

Breast Cancer Detection by Epitomic Profiling of the Humoral ImmuneResponse

This Example describes an investigation of the humoral immune signaturein breast cancer. The phage display breast cancer cDNA library waspurchased commercially from Novagen. The library was enriched for breastcancer specific phage epitopes using a pool of IgG from 10 breast cancersera and 10 normal controls. A total of 2,304 phage clones were pickedand printed on slides to make a high-density phage epitope microarray.By applying this platform, 77 sera samples were screened, including 42breast cancers and 35 normal controls. The images and data were analyzedand normalized as for prostate cancer (See Example 1). In order to builda predictor, a total of 28 cancers and 24 controls were randomlyselected and assigned as training set, and the remaining 14 cancers and11 controls served as test set. The best performing clones were selectedfrom the training set by t-test with 1000× permutation. A total of 21clones were selected with 81% specificity ( 5/24) and 79% sensitivity (6/28). When applying these 21 phage epitopes on independent test set,the same level of accuracy was achieved with 91% specificity ( 10/11)and 50% sensitivity ( 7/14).

Example 3

Humoral Immune Response Profiles Associated with Diagnosis and Prognosisin Lung Adenocarcinomas

A. Construction of Phage-Epitope Protein Microarray

The approach described above for profiling of prostate cancer (SeeExample 1) was used to identify epitomic biomarkers of lung cancer (FIG.13). To develop a phage display library for lung cancer, total RNA wasisolated from 7 lung cancer tissues (3 lung adenocarcinomas and 4squamous). The phage library was then enriched by affinity purification(biopanning) using individual serum samples from 6 adenocarcinomas, 4squamous and 3 non-cancer controls. Thus, a total of 13 enriched phagelibraries were created. After four rounds of biopanning, epitopes thatspecifically elicit a humoral immune response in lung cancer patients orcontrols were enriched for. Totally, 2304 phage-epitope clones wereselected randomly from the 13 biopanned libraries in order to generateepitope microarrays. Once in a microarray format, these enriched phageepitope clones were used to interrogate serum samples for humoral immuneresponse markers.

Using this high-density phage-epitope microarray platform, sera from 150lung adenocarcinomas and 101 non-cancer controls were evaluated. Asdescribed above (See Example 1), a two-color system was employed inwhich a green fluorescent dye (Cy3) was used to measure levels of thecapsid 10B fusion protein as a control for protein spotting, and a redfluorescent (Cy5) was used to measure levels of bound IgG. Therefore,increased Cy5/Cy3 ratios represented varying levels of immunereactivity. After normalization, data were used for subsequent diagnosisand survival analyses. Results are shown in Tables 1 and 2.

TABLE 1 Clinical information for Training/Test set samples Training setTest set Adenocarcinomas (n) 75 75 Age average (year) 63.6 66.3 Agerange 44–90 34–88 Male 37 37 Female 38 38 stage I–II 57 59 stage III–IV18 16 Dead 35 33 Alive 40 42 survival time (ms) 31.5 32.4 No-cancercontrol (n) 50 51 Age average (year) 60.8 60.8 Age range 36–77 40–77Male 30 31 Female 20 20

TABLE 2 Prediction accuracy of training and test sets Training set Testset Sensitivity 82.7% (62/75) 82.7% (62/75) Specificity 94.0% (47/50)84.3% (43/51) Accuracy 87.2% (109/125) 83.3% (105/126)

For diagnosis analysis, 251 samples were first randomly assigned totraining set (75 tumors and 50 controls) and test set (75 tumors and 51controls) with matched age, sex, stage and survival (FIG. 13 and Table1). In the training set, t-test combined withleave-one-out-cross-validation (LOOCV) was performed to build a classprediction model, and the top-ranked 59 epitope clones were selectedbased on their best performance on 82.7% ( 62/75) sensitivity and 94.0%( 47/50) specificity (Table 2). Prediction sensitivity and specificitywere computed based on the number of misclassified samples in the cancerand control groups. This prediction model consisting of 59phage-epitopes was then applied to the independent test set. The testsamples were correctly classified into cancer and normal groups with82.7% ( 62/75) sensitivity and 84.3% ( 43/51) specificity, respectively(Table 2).

In order to investigate the predictive performance of the immuneresponse profile, receiver operator characteristics (ROC) analysis wasperformed using the 59 phage-epitopes derived from the training set toassess the prediction accuracy in the test set. The discriminativeability of the panel of 59 phage-epitopes between cancers and controlswas statistically significant (p<0.0001) with an area under the curve(AUC) equal to 0.88 (95% CI=0.82 to 0.94) (FIG. 14).

A leave-one-out cross-validation approach was performed on entire 251samples (150 tumors and 101 controls) to select the best diagnosisrelated phage epitopes. The top-ranked 113 clones were found to give thebest predict values with 83% ( 125/150) sensitivity and 87.1% ( 88/101)specificity.

B. Humoral Immune Response Profiles Predict Survival

The association between phage epitopes and patient survival was nextinvestigated. First, the 150 cancer samples were randomly assigned to atraining (n=100) set and test set (n=50) with matched stage anddead/alive. LOOCV with Cox proportional-hazard regression model was usedto select the survival related epitopes in the training set. An epitoperisk index was created from 7 top-ranked survival related clones basedon median cutoff point of the index, which give the best overallsurvival prediction in the training set (P=0.004, FIG. 15 a). The riskindex and cutoff point were then applied to the test set. This riskindex of the top 7 clones correctly identified low- and high-riskindividuals within the independent test set (P=0.02, FIG. 15 b).

In order to select the most robust set of survival related clones, theLOOCV approach was used to identify epitopes associated with survivalfrom all 150 tumor samples. A risk index of the top 8 clones cansignificantly separate 150 patients to high- and low-risk groups (mediancutoff point, P=0.0008, FIG. 15 c). This risk index can also predictpatients with stage I, Ia or Ib cancer (FIG. 15 d, e and f). Furtheranalysis with univariate Cox model showed that patient stage, T or Nstatus were also related to survival, but age and sex were not (Table3). To analyze whether this epitope risk index is an independent factorfrom other clinical variables, multivate Cox model was performed on age,sex, stage and risk index. The result showed that this epitope riskindex is an independent survival predictor (P=0.003, Table 4).

TABLE 3 Univariate Cox's proportional hazards model Variable P value Age0.96 Sex 0.48 Stage II 0.02 III–IV  <0.0001 T status 0.02 N status <0.0001 Epitope Risk index  0.0008

TABLE 4 Multivariate Cox's proportional hazards model Variable HR 95% CIP value Age 1.02 0.999–1.05 0.06 Sex 1.13 0.693–1.85 0.6  Stage II 2.611.233–5.54 0.01 Stage III–IV 5.89  3.352–10.35  <0.00001 Epitomic riskindex 2.23 1.328–3.76  0.003C. Identification of Phage Epitopes

The phage display peptide microarray strategy allows for the easyidentification of humoral response targets by sequencing and BLASTsearching. The top 400 clones identified by previously LOOCV analysisbased on all samples were sequenced (Table 5). Some sequences were foundto be in-frame of known protein sequence, such as ubiquilin 1, nucleoarprotein 3 (NOL3), alpha-2-glycoprotein 1 and heat shock 70 kDa protein 8(HSPA70). Most of the humoral immuno response peptide targets weremimotopes.

Among the in-frame known proteins, heat shock 70 kDa protein waspreviously reported to be a humoral immune response target in lungcancer by another group. Two different sizes (113-197 and 113-219 CDSregion) of HSP70 were found with the same humoral immune responsepattern. Three clones of nucleoar protein 3 and alpha-2-glycoprotein 1were uncovered respectively although the serum antibody to NOL3 wasdecreased in tumors as compared to no-cancer controls and this humoralimmune response was related to an unfavorable survival in lungadenocarcinomas (P<0.006).

A total of 9 clones with 2 different sizes (112 aa and 125aa) of UBQLN1were found in this study. The mRNA was increased in lung adenocarcinomas(FIG. 16 a). Two forms of protein were found by 2D Western blot, ofwhich the native form was increased in tumors as compared to normal lungtissue and the phosphorylated form was decreased in tumors (FIGS. 16 band c). A second phosphorylated form of UBQLN1 was found in normaltissue only.

TABLE 5 Sequence Identity for phage clones associated with diagnosis andprognosis. Clone Associated with Translated Protein SEQ No. of IDDiagnosis Prognosis Sequences ID NO Clones Protein Identity 12G5 XPGLIPGFTPGLGALGST 1 7 Ubiquilin 1 GGSSGTNGSNATPSEN TSPTAGTTEPGHQQFIQQMLQALAGVNPQLQ NPEVRFQQQLEQLSA MGFLNREANLQALIAT GGDINAAIERLLGSQPS 12G9X QIQQGLQTLATEAPGL 2 2 Ubiquilin 1 IPGFTPGLGALGSTGGS SGTNGSNATPSENTSPTAGTTEPGHQQFIQQM LQALAGVNPQLQNPE VRFQQQLEQLSAMGF LNREANLQALIATGGDINAAIERLLGSQPS 7A2 X NSLESYAFNMKATVE 3 2 Heat shock 70 kDaDEKLQGKINDEDKQKI protein 8 (HSPA8) LDKCNEHNWLDKNQT AEKEEFEHQQKELEKVCNPIITKLYQSAGGMP GGMPGGFPGGGAPPS GGASSGPTIEEVD 18D11 X GAYPSTYDLDIEVHGG4 2 hypothetical LQPCLELEYGAEPIVGI protein OB1516 KGSLDSLASEEATMKVESWGSRKHEALYCIQ NTEI 4C10 X QAFPQQTGRRATSEPT 5 2 PREDICTED: AM similarto Coagulation factor II receptor precursor 2D5 X VTRPPSGRRPPTS 6 2PREDICTED: similar to B-cell receptor-associated protein 29 17H12 XAVAQMRMRMKMRM 7 2 TPA: HDC18596 RMGQEGTQQEPQQQN ILEDDTRDQGAHTGGPPGKPDADE 19G8 X QERQTRAQKKGTSSSG 8 2 putative protein HSTTKVIP 4C4 XGTEIDGRSISLYYTGEK 9 1 Nucleolin (NCL GQNQDYRGGKNSTWS protein)GESKTLVLSNLSYSAT EETLQEVFEKATFIKVP QNQNGKSKGYAFIEFA SFEDAKEALNSCNKREIEGRAIRLELQGPRGSP NARSQPSKTLFVKGLS EDTTEETLKESFDGSV RARIVTDRETGSSKGFGFVDFNSEEDAKAAK EAMEDGEIDGNKVTL DWAKPKGEGGFGGRG GGQACGRTRVTS 11G4 XLGTAIGPVGPVTPIGPI 10 1 Ubiquilin 2 GPIVPFTPIGPIGPIGPT GPAAPPGSTGSGGPTGPTVSSAAPSETTSPTSE SGPNQQFIQQMVQAL AGANAPQLPNPEVRFQ QQLEQLNAMGFLNREANLQALIATGGDINAA IERILGSQPS 5B4 X AERVSETWYMKGTVQ 11 1 apolipoprotem BHCDFN 22A10 X AKHSSAYTFFHPHSNP 12 1 hypothetical VSHYHPRFI proteinUM00661.1 7D8 X ARWGLRMG 13 1 acetyl-CoA acetyltransferase 7G8 XCCLPRFTESTSV 14 1 similar to ENSANGP000000 05259 8D5 X GELKGKEK 15 1adenine phosphoribosyltran sferase 1, APRT 13D2 X GKVGGGFLI 16 1COG0730: Predicted permeases 22F5 X GPQTDRPPQDRRPRHA 17 1 hypotheticalPCPQEGCVPLESNAGR protein MCA0617 PHNLLSDYSCDKSPGR SMTRG 17D3 XGSRGQEFKTSLANMV 18 1 PRO0478 KLHLY 1H8 X HLHNPGDPCRVMSQR 19 1 PREDICTED:PL similar to VPS1O domain receptor protein SORCS 3 18A7 XHPWAPKGWARWGAA 20 1 PREDICTED: PWAAGWPGTPALSAG similar to Zinc TPKLAAALEfinger protein 43 22C1 X IISRRGTNTAPLTSSSA 21 1 hypotheticalTTRTPARLWCCRS protein FG05539.1 1E8 X IKTKENMLREARQKG 22 1 hypotheticalLVTNGSPSD protein 6B5 X IRIAPLEVKFLDRRKTD 23 1 solute carrierQSESICQECFH family 9, member 4 4D1 X KKKDNL 24 1 COG0628: Predictedpermease 4E8 X KKTSGPDGFTGERYQ 25 1 ORF2 contains a XI reversetranscriptase domain 2B6 X KYWRSIEDRKI 26 1 cytochrome D ubiquinoloxidase subunit II 2G4 X LELQRQSSL 27 1 spalt4 13F6 X LEPSFSANYHKDKKTP28 1 PREDICTED: HVLTHRWELNNENTW similar to TQEEEQHTLGPVL glycogenin 213F9 X LIFRGNGQGMREGNKK 29 1 hypothetical protein AN5619.2 1B8 XLLLKLEPISQQ 30 1 glycosyl transferase, group 2 family protein 1F4 XLRQEDCLNPGGRGCSE 31 1 KIAA1556 protein PRSCHCTPAWATE 7E6 X LRSHAWWWT 321 trbI 10G2 X LSISCL 33 1 hypothetical protein FG08221.1 2C6 X MVLVNLKP34 1 heparan Sulfate- glucuronic acid-5- Epimerase (hse-5) 7F9 XNKTPSVPHNHFSLIK 35 1 PREDICTED: similar to zinc finger protein 300 8B6 XNSCILKEDKDILKKPL 36 1 asparagine-rich NSRFSSNSKVKNMRLL protein, putativeEHSTFSAPLNRVM 7E10 X NSDFYDFFHK 37 1 Hypothetical protein CBG01255 2D10X NSEGRLLS 38 1 Hypothetical protein ZC443.6 2F9 X NSFDLVGTGGLEESRL 39 1TPA: olfactory SIPWPLGSLLYAKSPRK receptor OR11-50 3C5 X NSKESI 40 1ATP-dependent helicase 3D1 X NSKNTVLQLDSVRSM 41 1 immunoglobulinSESRAITT heavy chain variable region 2B9 X NSLPGLPSLYFVSMAK 42 1GH05757p HKNNTSTTIS 7B7 X NSPNTLFRSASTKPK 43 1 genral secretion proteinE 2C5 X NSQECLSQILLIPSSCL 44 1 ENSANGP000000 KKNICV 11065 7E11 XNSRLRGIL 45 1 COG0330: Membrane protease subunits, stomatin/prohibitinhomologs 4B6 X NSVFLPFINMFIRKWY 46 1 sensor-histidine HSEHISYILFFFCVWIFkinase VanSc TLR 11D1 X NVTRVFK 47 1 hypothetical protein 7A10 XPASTLKGQDARNRLT 48 1 similar to AF15q14 QK protein isoform 2 1B12 XPIHMCYTGAKKEGCF 49 1 CIR protein, VGKSS- putative EEVPRTWLLSLKGDGVNSPCWGSY 13D1 X PQIASHSLFLLPRVLST 50 1 hypothetical SIIS proteinGZ28G717 5A4 X PQMTKTKRTHKNI 51 1 FP588 17A5 X QAYVNV 52 1 COG1538:Outer membrane protein 8B3 X QEASVSGLKMKSMST 53 1 S2 gene productKQVWNQIAFDEKGSG FWRLYFRCCYNASSN QD 6A6 X QTCKQLQFLPFAS 54 1 PREDICTED:hypothetical protein XP537924 7B10 X RMTYLWGLNHKPTDN 55 1 putativepermease VNCHSQFLP (MFS superfamily) 5D5 X RSQFQQGNVPVQSRLR 56 1hypothetical protein having cryptosporidium- specific paralog 2B3 XRVTPTAEQSPIPGCRK 57 1 TonB-dependent receptor 1A8 X VCSSSIHRSPQVERVSP 581 PMF31 PHHFPEEQT 3D5 X VESASLHLDCF 59 1 hypothetical protein BH115603B7 X VGGGRASGRIANGCWA 60 1 AMPA GLutamate Receptor subunit (glr-2) 1A10X VPIQMPPEATCVT 61 1 hypothetical protein Bcep02003282 6D2 X VSNSMKI 621 ORFveg109 1F6 X VVSGSGHLERSQDCGE 63 1 likely glycerol-3- KGNIFQphosphate dehydrogenase 20A12 X AHSPTKGCQICQDQEK 64 1 putativeretroelement pol polyprotein 20D12 X AHSRRKTAGN 65 1 recombinationactivating gene 2 6G7 X EHIPAPASPRFSIQGS 66 1 PREDICTED: similar toHypothetical protein 4832420M10, partial 10D10 X GNRDPVAC 67 1 TPA: 52 K17H8 X GPWHQMPSPTKGWLG 68 1 flagellum-specific RISQ ATP synthase FliI15B6 X IAHSGSSVF 69 1 Niemann-Pick disease, type C1 15A12 X IQCVYKPNSHFV70 1 Similar to RIKEN cDNA 4930429O20 19B12 X IYISLNVVTLKACTLKF 71 1ENSANGP000000 GCINATFNLN 25688 23E12 X LFYGGMGGWKNGSR 72 1 NIb proteinASEAD 15E9 X LLQRNTVPQKQRNKA 73 1 PREDICTED: GWRMTLTS similar to ankyrinrepeat-containing SOCS box protein 5 16H8 X LPSVARRSPGLGPQLR 74 1parathymosin-lilce QQGGCGPVCHHHQDI PPPQGLPFPLAPSPFL 8B12 XNSALGNHGEGKPIVEC 75 1 two-component LLRC system, sensor protein 6H3 XNSASSKCPSY 76 1 hypothetical protein PMM1351 21G10 X NSFKAIRK 77 1 CDC27D-618 protein 17H10 X NSFLEGEEQIL 78 1 hypothetical protein LIC1195014E12 X NSSVTLMRQRVTMMG 79 1 DNA RHTT topoisomerase II 21C12 XPDWDAVVQSWLTAAS 80 1 ADAM 32 NS precursor (A disintegrin andmetalloprotease domain 32) variant 16C7 X PRRTGEGAPPARLARR 81 1PREDICTED: AGEVEHERTC similar to testin 17G10 X SKLSKGYEKLVF 82 1putative transcriptional regulator 16E8 X TMPKGNVKLGN 83 1mitogen-activated protein kinase kinase kinase 3 isoform 2 8F11 XVITLIYR 84 1 hypothetical protein OB0069 16H11 X X GPEGSEAVQSGTPEEP 85 3nucleolar protein 3 EPELEAEASKEAEPEPE (apoptosis PEPELEPEAEAEPEPELrepressor with EPEPDPEPEPDFEERDE CARD domain) SEGIPEGQSSDRRCPA HAG 16E9X X PQCREKTKEN 86 1 tripartite motif- containing 7 isoform 4 16B11 XSGMPRRYSDYPDAYTT 87 1 cytochrome c oxidase subunit I 16E11 XDVRVSIHKHILG 88 1 nucleolar protein 3 (apoptosis repressor with CARDdomain) 5E11 X GKRRDSFFSF 89 1 hypothetical protein AM638 14E11 XLETIILSKLAQEQKTKH 90 1 putative p150 RMFSLISGS 16G11 X NSPSVGLFTH 91 1MUP1 10G9 X NSRLYQKYKN 92 1 similar to GG9996- PA 5E3 XPARLARRAGEVEHERTC 93 1 hypothetical protein Magn028940 16F11 XSLTSTASDGDYSARTVM 94 1 COG0568: DNA- directed RNA polyrnerase, sigmasubunit 10G11 X TQSPTTLNVAGTPQQ 95 1 IgG kappa light chain variableregion 21C5 PSQLKCSPSANVKMG 96 14 glycine GGKGLKIRENCMHLRT decarboxylase13E11 GERGKRTFQKESDTAL 97 11 BRCA1 protein ILRECPICL 11A1NSLEWTKVYLGKKIW 98 7 FAM53B protein TPEKGNSSYK 13B3 RPQTDRPPQDRRPRHA 995 PSIP1 protein PCPQEGCVPLESNAGR PHNLLSDYSCDKSPGR SMTRG 19H9GQQRKPCLGGKKKT 100 3 CGI-143 protein 22A9 NSTATTSSSSLKDPGSR 101 3Oncogene EMS1 RPSWTSLAKERSQEQA KRNLEFQSPTLSPPMK ATLSKPS 16B9 PCSKH 102 3Siah2 protein 15D6 QERPSETIDRERKRLV 103 3 nucleolar protein 3ETLQADSG...EPDFEER DESEDS 13C8 RICPTHTKPQNTVPLH 104 3 FAT tumorLLRPTIDQL suppressor 2 precursor 12E5 WVSEPHCVVVNM 105 3 Kinesin-likeprotein KIF13B 15E3 GAGTGARARARAGAA 106 2 ALEX2 protein LTWS 17A7ILLMRRRMTRMSGGA 107 2 CREB-binding EQTQTMQMGVKTK protein 17B10LHHIGQQHPQRFWHQ 108 2 telomerase RPLS catalytic subunit 18H6LMRVLKTEVTGYQEV 109 2 EF hand domain CTPKRNWNSRQE family, member A113E12 NSLIQHQHLGQI 110 2 ZFP-95 19G6 NSQGLDFSKATLRSRQ 111 2 TIP30 RL18F11 NSSDSLRIVWLLSDVY 112 2 CCAAT/enhancer ESFLHLPFQISHCSWY bindingprotein KYLS alpha 14H12 NSSPADLPCRIC 113 2 UbcH 7-binding protein 21E12RTPSSPCWPPGPVLAE. 114 2 nucleolar protein 3 ..EPEPDFEERDESEDS (apoptosisrepressor with CARD domain) 13A6 RVPKQRYRSMEQNRA 115 2 tumor-relatedLRNNAVYLQLSDL protein DRC2 10G3 STKKMGTQALSKAAP 116 2 kringle-containingHG protein 15B12 TRSGSSSWAVLTGARP 117 2 HSPC017 KRLCAATFPNMEKS 8G11AEEYRLQRHYCSY 118 1 Pleckstrin and Sec7 domain containing 2 23D12AESTPVQDPSIFCEYST 119 1 Chain B, Binary PTSMGGGK Complex Structure OfHuman Tau Protein Kinase I 11A2 AEVPILFIPP 120 1 solute carrier family 4sodium bicarbonate cotransporter-like member 10 17A1 AGGSFSPWPVLLPPPPP121 1 frizzled 8 GGKSGHNRGQRPH 10D4 AHIRTKDSTNCI 122 1 TRIM14 isoformalpha 6F3 AICSIL 123 1 10E5 AIGKIAKNNP 124 1 SFRS protein kinase 2 16E4ANNLLNGGLYTGKPY 125 1 RAD51D CGN 10C11 ANQLNELNPK 126 1 9G11AQGPRCAGCTGKGRT 127 1 TAG 6D4 AQVLCHIEDQVPDQIL 128 1 PGVPLELLGEFCQESGRRK 12B5 ARGPSWRSNELWLHH 129 1 LSSSSRHLMSS 1A11 ASCYLTSNCTFITRVQ 130 11F11 ASRKIWYELNSGYAE 131 1 WRTEEAIRRSGRHQVQ 1E7 ATLSV 132 1 4E5 AVYFFKAK133 1 13H8 AWYKICKICL 134 1 14B9 AYNKFLHL 135 1 21A12 CWPGWSQTPDLR 136 17H8 DEWKNTFQGELKGLKC 137 1 14A5 DKKFLIETSI 138 1 7G7 DVFNTVGPLGWSVFH 1391 PQTNADQNGVF 1G7 ECQGQC 140 1 6G6 EEEHSDKYVLSLLMNS 141 1 LSLRS 6G2EFFLMTIGKN 142 1 17G8 EKEKNLNCFFGRTTTK 143 1 KR 7A5 EKLATSMYLQNIPNWR 1441 LSSESEVSME 9F11 ELESCCVTQAGVPCYD 145 1 LCSLQPPSPGFK 12H4 ELLFL 146 121B8 EMLNGGRVLWM 147 1 12B3 EQLQT 148 1 4F10 ERKVF 149 1 8B1 ETSIKYT 1501 17G12 GAGKFLREKEKEISLG 151 1 LMLGK 8E5 GCLG 152 1 1F7 GCLGFWGRG 153 115C3 GEACLSTATSW 154 1 6G12 GFLTMERKKITPPTTK 155 1 TYISTLPTDSIKQLRNGDYKATS 7C9 GGCDHCRDTTHGGCG 156 1 HCGLRGNIPSRPPDLQD CLC 3A6 GIFFVSKI 1571 3A1 GIGNVKDGRHGESF 158 1 14A11 GISPTKEDVIHSDVQD 159 1 ELVHSACYVCI 23F5GKHEGEG 160 1 3C10 GKIDERGRQGGRERD 161 1 RNRDRERQRERE 17B6GKPKRHWDERAAGGL 162 1 1A1 GKPTPLIQ 163 1 9F9 GKVKELNKEVREKKG 164 1KIKQYNTXQKGKKSR RQCKNS 7E7 GLPLWRRERVKVMR 165 1 5G11 GLWWKRKYLHLNTRE 1661 KHSQKLLCDDCIHLTE LNIPIDRAVWKHSCCG MCKWRFGAL 24D5 GMST 167 1 21D2 GNYAK168 1 21D12 GNYARQ 169 1 11H11 GPAFVLMKPGASPYPI 170 1 LALTLITNQMLQNKSNNDPN 1F9 GPFCHQRSGNPRIHHQ 171 1 HSQAHPWSGLQEACT SGTQRDSEICHEGDGN SRCAH8G5 GPTSN 172 1 21H11 GQHYPNTKARQKITTR 173 1 KL 10F2 GQRLIIING 174 1 1G3GRCVVATEINSRNRDS 175 1 ACQEFEFRV 13G1 GRGRTRWGMGMLLK 176 1 KIQ 3E1GRPGIGATHSSRFIPLK 177 1 19F5 GRVPFTFFNLSL 178 1 2B10 GTSSSHIDPLSRLPKLN179 1 LSRGGVWASWVK 3H10 GVERVAYSIHPASPTS 180 1 VSHSLVERMAMAPPVMESMRSPPQSTRPRVP LS 17B12 GWGRRIA 181 1 6D6 HCHCLPDLP 182 1 3G10HILSSTCCFLTF 183 1 7D6 HLWAQHHSVSSLKGR 184 1 TTLEYF 17B4 HTFKNTWELKNENTW185 1 TQGGEYHTPGPAGGF GGKGRESIRTKI 8F4 IASYM 186 1 16G2 IDLKSNL 187 112G12 IFRN 188 1 4F5 IGTRDQGKRLRMK 189 1 7G1 ILLQGYPGSSSTSLRPH 190 1 SSN16E3 INQKYTWLDKSHYAL 191 1 TTNASS 4F11 IQNSKKS 192 1 17C8IQSATELVGRLGMHPR 193 1 IQSATELVVS 14B10 IRASNQYRSSVKYISVH 194 1 6A3ITPRAVFWY 195 1 20D10 IYFKKKKT 196 1 7H2 KDHAQSNKYLTSL 197 1 4E9KGMNKTSKNCGTM 198 1 15G5 KGTFFRSGSLGCK 199 1 2G11 KIYNI 200 1 4D5KKAERSTK 201 1 1C8 KKEESSSRMWPL 202 1 22C12 KKHFICTSFLDLGYTV 203 1 PVY12D2 KSFCRIFLCW 204 1 20B6 KSTAHSLCKGLM 205 1 11H2 KTTIF 206 1 21E6LAYVSNSHQGKFGWL 207 1 SGLSR 7G11 LDGMLAAQTEEDPET 208 1 15F4LETEAGESLEPRRWRLQ 209 1 22D3 LEVRISRPSWLTR 210 1 13A12 LHKPQSQWTR 211 14A9 LHQNPKGLGSESFWIT 212 1 LPGR 20C1 LKDVTVSVRLAPLYISM 213 1 14F2LKIIENCLNPGGRGCSE 214 1 SRWCRCTPTRTTE 10A9 LKQILSSVLNSEIELLL 215 1 9H8LLHMAAARRSAEQRG 216 1 KSPS 7C2 LLPQPPE 217 1 16G1 LLSHLQDWQHH 218 1 12G1LLSKSLRNEDTAVV 219 1 7B8 LQTGKEKASHPPPTLF 220 1 SPIIYNNTDLRAVKVILKYYIKWVRRE 14G11 LQVTLPRRGRDTCGSH 221 1 REATER 16G12 LRIT 222 1 23B7LRLSTPWPTLKPHLKG 223 1 KVMSL 16G10 LSESIWFAFHFDDCK 224 1 15F5 LSHGTG 2251 1C11 LTRNDI 226 1 11B9 MKEYA 227 1 11D12 NELWLHHLSSSSRHL 228 1 MSS10C12 NGCVYLSKFKL 229 1 mci domain family, member 2 17A3NKEREVFSTNGTGYIPH 230 1 GKKRTTQ 15D12 NNQK 231 1 1E4 NRGKHRG 232 1 4A5NSACL 233 1 1C12 NSAQN 234 1 8D1 NSASTEPSTNRLQLPW 235 1 VGGLMQTGRLPGSLTA18D4 NSASTRPISHIRRRTLL 236 1 SSA 11B10 NSDLVRHQFKGKTTL 237 1 KVH 5D4NSDQIQNTGAESREKV 238 1 RMSITADEFVG 3E4 NSDVI 239 1 3B8 NSECTCIIVKGNTFSPC240 1 KFIV 4D2 NSEG 241 1 13H12 NSEGAT 242 1 2A7 NSEQQRLKELKSEHTN 243 1NKKVKQPCC 15D1 NSESNSFASKNKFN 244 1 21B1 NSFCVCVFNSQS 245 1 8C2 NSFGFST246 1 18C9 NSFLLEIQEPSLGVWIR 247 1 TPFL 10E11 NSFLSF 248 1 11F3NSFPSSICFNS 249 1 1E10 NSFQGLQDYLIKSSMN 250 1 TRHDE LVL 15F11NSFRKQRHWKG 251 1 6C6 NSFRL 252 1 20E10 NSFRPHRFKSNA 253 1 7C12 NSFRYFA254 1 11E7 NSGVSW 255 1 9E3 NSHCDI 256 1 4C9 NSHNPKLEK 257 1 7A3NSIHHVLLSLHPPLYK 258 1 3A2 NSIHM 259 1 22C3 NSIIPRAIWLSVERMW 260 1 QLRW2A6 NSIKCKKM 261 1 12H7 NSIKRFSASCVARICPG 262 1 18D6 NSIL 263 1 17E4NSILIKYGDTWN 264 1 1G10 NSILQSAGESFLLHNL 265 1 NLCS 2G3NSITHLEKHTILYTNSS 266 1 TK 3A4 NSKETSSNGTEWNPH 267 1 17B5 NSKGRRV 268 19E4 NSKIAR 269 1 21H6 NSKIMFSKMFLSQITE 270 1 19H5 NSKQRFFLKKK 271 1 17C5NSLCGICI 272 1 7G11 NSLKKL 273 1 19H7 NSLLCLICLT 274 1 10B2NSLNKIQNTFESSTID 275 1 21B4 NSLPLT 276 1 10B10 NSLPWKQKV 277 1 Chain A,Structurally Distinct Recognition Motifs In Lymphotoxin-B Receptor AndCd4O For Traf- Mediated Signaling 12D7 NSLS 278 1 11H12 NSLSFADWFWKRS279 1 5H5 NSLSSFHCSSHCF 280 1 8B2 NSMMDHVTNNATGM 281 1 NIMEK 4G1NSMSMPRLCGRMKEC 282 1 VPATNAPTSTS 13C9 NSMVVTATSYSTPIPE 283 1DRLSTRGKEQMPHEMS 7E5 NSNEE 284 1 22E11 NSNPYPGGRSTSGDPK 285 1FKPRNCSVPQWLGYN PFWP 4F2 NSPAGISRELVDKLAA 286 1 ALE 1C6 NSPASAS 287 110B1 NSPKMGSPSLLKYYT 288 1 9D1 NSPKMGSPSLLKYYT- 289 1 RS 6A2 NSPPAN 2901 3D4 NSPSQPACLGAQR 291 1 5F1 NSPVPSVTTDYQNISLLT 292 1 10H10 NSQAVCIFF293 1 21H10 NSQNVFNSSSFHFMAL 294 1 ERYRRK 1H5 NSQRLIWLSN 295 1 14H6NSQVGLSSSYPQ 296 1 3D3 NSRCHCPA 297 1 8A1 NSRFDF 298 1 11D4NSSDITLIEKKELIKANI 299 1 TAK1-like protein 2D11 NSSFLMT 300 1 4E11NSSFLQGALVPLSGE 301 1 17D6 NSSGLLKVSLLKYHPS 302 1 FMNSRGFSLQVL 16G8NSSRQPHPLLTSLNILYI 303 1 3B10 NSSRTAFSFHSLLLL 304 1 10G5NSSSSQHREHEKEEKY 305 1 HGDF-related pro 2 4D7 NSSSSSNPILSHGTTKN 306 1KVCSAPEALYAGDGQ LNENLKGKPSGLRCVP LRDFT 17A9 NSSSYRPQRVWCGSIC 307 1SRASTGIPIPQGLPPKY LAFKELSYLNSAGTSC 7F8 NSSV 1 18C5 NSSVTLMRQRVMMM 308 1dipeptide ABC GRHTT transporter, dipeptide-binding protein 11H8NSSWHIRSQGEDNRET 309 1 ALVYRKQIFSETLHYY 20E7 NSTDK 310 1 16B6NSTGNMKGIIILTFQLK 311 1 RMGKPTPLLF 1D4 NSTR 312 1 19A2 NSTSKSVEHS 313 19A3 NSTVLKYVTLPHLRE 314 1 5F2 NSVCV 315 1 10C6 NSVIIESLVVNV 316 1 1C7NSVNFILIPLDLEG 317 1 12C8 NSVQGRAVLLCHGLT 318 1 GRAWFYLYGLFCV 6C2 NSVVH319 1 4E4 NSVYMI 320 1 3F3 NSYCVNQAGLELLASS 321 1 DPLALASGMLGL 1H4NSYLFSR 322 1 1D12 PAWATKSKTPS 323 1 13H6 PGLGEWCRVCV 324 1 6B10PGRIHLAEAQHGHPRP 325 1 CLHSEVFS 3E6 PHATSHLRVKHEISQIQ 326 1 HPPLLS 14F11PISLRGATAGRAERIRE 327 1 EEVRGAVHHKRIH 7B1 PQRTTLNFLLGQPARL 328 1PLGLSVGDRPTSQGR 1B9 PRIFPSSAQQRMK 329 1 5E11 PSRPPRRGGGARAHVL 330 1GPERW 1A9 QGHTGVSHK 331 1 1B7 QKTKHRIFSLIGGN 332 1 2A2 QMLLLPAI 333 13E12 QRSRVAEGWRGPLNP 334 1 ELTPKCIDPSMHGWR 20F1 QSLPPARNCNKPDSML 335 11E9 QVPRVLPQHRLGLAG 336 1 GADD45 gamma EEAGAPSIPATDHRRL RSGQL 2E2QVSGPPSKI 337 1 2H3 QWLTPVIPTLWEAKA 338 1 breast cancer GG suppressorelement Isbmael Upper RP2 2B4 RALQQLRIHPDLHLQR 339 1 RSQAQQHQGGQDS 14E10RAVRREASHRPSPPLA 340 1 SRRPLDALS 4D8 RDDSDYSVE 341 1 18F10RECTRCRRKTESTAQR 342 1 VKKPATLLASVKPPAN AVSTM 1B5 RGPKRLL 343 1 20G3RISILKR 344 1 18E11 RIVRVTPRRSWNHYET 345 1 IESKE 8G6 RLGPQARHG 346 118F2 RLHR 347 1 1E3 RMKQIVRKVEPIMT 348 1 19D3 RMMSSSIQSLRKAGSEP 349 12E7 RNWNKPSKRNCP 350 1 8C11 RPQP 351 1 14C10 RPQTDLPRTDVPGTLL 352 1PSIP1 protein VLRRAASPWSPTRGDP ITCCLITVVISPREGA 13B11 RPTDRQTSPGQTSPAR353 1 SLSSGGLRPPGVQRGA TP 2F6 RQDCF 354 1 19C12 RRLLGLYMVL 355 1 6F7RRRLW 356 1 14B8 RRSRPSWPTG 357 1 1D7 RRWTKAHCK 358 1 10H11 RTLKAEVEKGSM359 1 20E12 RVPFTFFNLSL 360 1 22B12 SFSRG 361 1 12A11 SLSSTHFDICAGSGGR362 1 RSTKCKGLSTSVQCVY EEAH 23H10 SNEGLKEVKISTCRLS 363 1 KQSVSKLLNEKKS15F12 SNSHSPSTQGSLDCVF 364 1 QETHLIWSDFVSPPKS HLEL 6D9 SRRMA 365 1 12E11SRSASFMVGTTTVSDR 366 1 LRTSDFRS 2H5 SXARXPIQRESRMGD 367 1 13D4TLPGLRTPVSTRPTGTV368 1 PIPPIL 1G11 TPTRDTSVMQIEETGR 369 1GKESSTMVVATTIHHG EATGTISMSSTGTRTTI MGTGDIWMPTVPEAI DPYITCPERGLMTSTSLRPHSSN 15H6 TRLAWDLNWKLNVV 370 1 2A10 TRPPSGRRPPTS 371 1 7H12 TVLFGV 3721 21H4 VAQRPAGPVGWAAG 373 1 GEALIG 1E11 VFEDLKKYLKF 374 1 putativeprolyl oligopeptidase 20F12 VFTVVISTSGARCQRQY 375 1 8C10VGSWERAGGPPRGEPP 376 1 PVPAPCLSAPPRCS 24H12 VGTIY 377 1 4E6 VGVGIILS 3781 2D6 VHYHNINNLVK 379 1 21D5 VIGSLMGMALNL 380 1 16A12 VKKLVVGSWERAGGP381 1 PRGEPPPVPAPCLSAPP RCS 17D12 VKNYF 382 1 9G3 VLLYLKR 383 1 8C3VPGHARWLTPIIPALR 384 1 DAEAGGS 9D8 VVCSISLLSF 385 1 2E8 VVFLR 386 1 14A9VVQTESLKSPSTYRCA 387 1 QQDQVTSSSDCHHK 3E11 VVVVVETGAI 388 1 1G1VYGRNYDGI 389 1 13A3 WELNSEKTWTQGGEH 390 1 HTPGPLWGRGARGGI ALG 16D10WKKNSRCY 391 1 22H4 WKSGRS 392 1 24F10 WMQSKYSKKSCCYVYG 393 1 11F5WPPELRLLTDQWQHSI 394 1 LMGM 20H3 WPPSSGPDCRFTHAIKL 395 1 16B7WRSSFPSTIYGKD 396 1 19A1 WSGWPT 397 1 11F11 YWTNPPTLTIPRHHLS 398 1 TVLA

Example 4

Humoral Immune Response Profiles Associated with Prognosis in ProstateCancer

This example describes the investigation of association of phage epitopeclones with prognosis of prostate cancer. The prostate cancer cDNA phagedisplay library described in Example 1 was biopanned using a pool of IgGfrom 16 prostate cancer sera (7 samples with Gleason=6 and 9 sampleswith Gleason=8 and 9). After construction of phage epitope microarrayplatform, 32 sera samples were screened. Raw data scanned werenormalized as described in Example 1 for prostate caner diagnosis. Inorder to identify the phage clones for prognosis, the samples wererandomly assigned to a training set (31 samples) or a test set (11samples) with an equal proportion of samples having the same Gleasonscore. T-test combined with leave-one-out cross validation was appliedon the training set. Low risk patients with a Gleason score ≦6 and highrisk patients with a Gleason score≧8 were considered as two groups. Atotal of 21 clones were selected based on their best performance on thetraining set with 100% specificity ( 13/13) and 62.5% sensitivity (⅝).When applying these 21 phage epitopes on an independent test set, itsperformance was shown to be 100% specificity ( 4/4) and 75% sensitivity(⅚).

All publications and patents mentioned in the above specification areherein incorporated by reference. Various modifications and variationsof the described method and system of the invention will be apparent tothose skilled in the art without departing from the scope and spirit ofthe invention. Although the invention has been described in connectionwith specific preferred embodiments, it should be understood that theinvention as claimed should not be unduly limited to such specificembodiments. Indeed, various modifications of the described modes forcarrying out the invention that are obvious to those skilled in therelevant fields are intended to be within the scope of the followingclaims.

1. A method of screening for the presence or absence of prostate cancerin a subject, comprising: a) providing a sample from said subjectsuspected of having prostate cancer; and b) detecting the presence orabsence of a humoral response to a tumor antigen selected from one ormore of the group consisting of Eukaryotic Translation Initiation Factor4Gamma 1 (eIF4G 1), Ribosomal Protein L22 (RPL22), Ribosomal Protein L13A (RPL13A), and protein XP_373908 (SEQ IDNO:402), wherein saiddetecting comprises detecting the presence of an autoantibody to saidtumor antigen; wherein the presence of said humoral response isindicative of the presence of prostate cancer.
 2. The method of claim 1,wherein said detecting comprises detecting the presence of anautoantibody to said Bromodomain Containing Protein 2 (BRD2), the 5′untranslated region of B lymphoma Mo-MLV insertion region 1 homolog(5′-UTR_BMI1; SEQ ID NO:407) or hairy and enhancer of split 1, homologof Drosophila (HES1).